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Overview 


INTRODUCTION 


This manual is a guide to understanding and programming the mach64 accelerator. 
The maché64 accelerator is a fixed-function, 2D graphics acelerator. It is function- 
compatible but not register-compatibl with the mach32. 


Those seeking a general understanding of the features and functions of the mach64, 
or experienced mach64 programmers seeking a function summary, need only read 
Chapter 2, Programming Model. 


Very specific examples and techniques are described in Chapter 3, Simple Draw 
operations, Chapter 4, Advanced Topics I, and Chapter 5, Advanced Topics II. 


Major Features 


¢ Full draw capability at 1, 4, 8, 15, 16, and 32 bit per pixel color resolutions. 
Hardware-assisted draw functions available for packed 24 bit per pixel draw 
modes. 


* Spatial resolution of 640x480, 800x600, 1024x768, and 1280x1024. 
* Full read /writeable memory-mapped registers. 

* Up to 8M of memory. 

* 16x32 command FIFO. 


¢ Four-color (two fixed colors, complement, and transparent) hardware cursor 
of size up to 64x64. 


¢ Overscan. 


* Linear frame buffer locatable on 4M boundaries anywhere in a 4G system 
memory address space, and sizeable to 4M or 8M. 


* Paged frame buffer with two, 32k pages, pageable on 32k boundaries 
anywhere in the 8M video memory address space. 


© 1993 ATI Technologies Inc. PRG888GX0-01 
Proprietary and Confidential i 


INTRODUCTION 


Major Features (cont’d) 


¢ Draw functions include rectangle fill, line draw, bitblt, polygon boundary 
lines, polygon fill. 


¢ Generalized 2D patterns with rotation. 

¢ A linear memory mode for efficient memory management. 
¢ Efficient monochrome expansion. 

¢ Bit masking and scissoring capabilities. 

* Seventeen-function ALU for ful suite of logical ROPs. 

¢ Source compare logic suitable for alpha channel mixing. 

¢ Draw engine context loads for fast context switching. 

° Context chaining for grouping complex task lists. 

* Scrolling and panning on a virtual desktop. 


EEPROM hardware support for non-volatile storage. 


¢ Four-level hardware Display Power Management System (DPMS) mode 
support. 


¢ DAC power-down support. 
* Diagnostic test modes. 


Functional Enhancements to mach32 


* Full draw capability in 1bpp and 32bpp modes, and hardware assist in packed 
24bpp mode have been added. 


° Full 32-bit registers. Some register pairs may be written in a single 32-bit 
write. 


* Device coordinates have been expanded to -4096 to +4095 in the X direction, 
and -16384 to +16383 in the Y direction. 


¢ Bresenham parameters have been expanded from 12 bits to 18 bits. 
* Packed monochrome expansion. 
¢ The paged frame buffer is now pageable on 32k boundaries instead of 64k. 


¢ The source trajectory types, strictly-linear, general-pattern, and general- 
pattern-with-rotation, have been added. 


* Source compare. 
* Contexts and context chaining. 


¢ Four-level hardware Display Power Management System (DPMS) mode 
support. 


¢ DAC power-down support. 
° Diagnostic test modes. 
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Deletions From mach32 


Point-to-point line draw 

Line clip exception handling 
VNIB and VPIX type rectangles 
Short-stroke vectors 

Scan line draw 

Four compare functions 
Bounds accumulators 

CRTC shadow sets 


Host reads; screen-to-host transfers can still be accomplished by aperture 
reads 


Degree mode lines; Bresenham lines are still supported. 


Functional Differences From mach32 


Monochrome blits are now packed instead of sparse. 


Host writes are packed to 32 bits. Ibpp and 4bpp modes may be optionally 
aligned to a byte. 


Pixel consumption order from the host data register is only programmable in: 
Ibpp and 4bpp modes. 


Polygons are always inclusive on both edges. 


Polygons derive their boundary data from an implicit polygon source instead 
of an explicit monochrome source. 


Rectangular trajectories are specified in width and height instead of start and 
end. 


The ALU carry chain mask is set explicitly instead of implicitly from the pixel 
depth. 


Line drawing options do not affect rectangular trajectories and rectangle 
options do not affect line drawing trajectories. 


Destination side effects (tiling) are now programmable. 


Source pointer always returns to the original SRC_X, SRC_Y position after 
draw completion. 


Pixel depths, pitches, and offsets are independently specified for CRTC, 
source, destination, and host. 


Bresenham parameters have been expanded from 12 bits to 18 bits. 
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SAMPLE CODE ORGANIZATION 


Csource code All C source sample code is compatible with Microsoft's C compilers (version 5.1 

simnn his = and newer). All assembler source sample code is compatible with Microsoft's 
MASM compiler version 5.1. Each sample code example has its own program, 
based on the following template: 


Query the mach64 hardware § 


ont ms 


Enable the VGA aperture, 
linear aperture, or both 


lear the screen, if appl 


ie ose es as sy “ SS SSS nee 


emonstrate code example 


SSD oe ate PPPOE 


Wait for a carriage return where 
applicable, to see the results § 


Disable the VGA aperture, 
if it was enabled 


Re-initialize the VGA controller § 
for a VGA text mode ; 


Constants All constants used in the printed sample code are included in Appendix F, Sample 
Code, under the following header file names: 
ATIM64.INC 
ATIM64.H 
SAMPLE.H. 
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Utility Any utility functions not included with the printed sample code are listed in 
functions Appendix F, Sample Code, under the following source file names: 


ATTR.C 

DRAW.C 

GLOB.C 
HWCURSOR.C 
INIT.C 
MEMREG.C 
PALETTE.C 
WAIT.C 
MOVEMEM.ASM 
ROMCALLS.ASM 
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Programming Model 


INTRODUCTION 


The mach64 has two, distinct operating modes: 
° VGA mode 
° Accelerator mode. 


For more information on VGA programming, see the mach64 VGA Programmer's 
Guide. The accelerator provides the ability to draw into screen memory 
concurrently with the operation of the host CPU. 


mach64 DETECTION 


To detect the presence of a mach64: 

1. Check for the ATI product signature. 

2. Read SCRATCH_REGO and save its contents. 
3. Write the value 0x55555555 to SCRATCH_REGO. 
4 


Read back SCRATCH_REGO. If the value is not 0x55555555, a mach64 is not 
present. 


Repeat steps 3 and 4, using the value OxAAAAAAAA. 
6. Restore the saved value of SCRATCH_REGO. 


7. Read the CHIP_ID register for additional information on chip type, class, and 
revision. 


wi 


Additional configuration information can be obtained with a BIOS query call 
(functions 6, 7, 8, 9, and OxA). See Appendix A, BIOS Services, for more 
information. 
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mach64 DETECTION 


Sample code 


This is sample code that detects if a MACH 64 based video adapter is 
installed. See INIT.C and ROMCALLS.ASM for more details. 


detect_mach64 - determine if a mach6é4 based video pagigs is installed. 


This routine identifies if a mach6é4 based video adapter is installed. This 
is done by writing and reading the SCRATCH_REGO or SCRATCH _REG1 io based 
registers. These registers must be saved and restored since the mach64 
BIOS uses their contents. 


Returns YES_MACH64 if detected, NO_MACH64 if not. 


int detect_mach64 (void) 

{ 
unsigned long save_value; 
int result; 


// assume failure 
result = NO _MACH64; 


// check for ATI rom signature 
1f (is_ati_rom() == 0) 
{ 
return (NO_MACH64) ; 
} 


// save old value 
save_value = ior(iLoSCRATCH_REGO) ; 


// test odd bits for readability 
low(LoSCRATCH_REGO, 0x55555555); 
if (lor(LoSCRATCH REGO) == 0x55555555) 
{ 
// test even bits for readability 
Low(LoSCRATCH_REGO, Oxaaaaaaaa) ; 
1f (Llor(iLoSCRATCH_REGO) == Oxaaaaaaaa) 
{ 
result = YES_MACH64; 
} 
} 


// restore old value 
iow(LoSCRATCH_ REGO, save_value) ; 


return (result); 
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MODE SWITCHING 


a 


CRT controllers 


It is highly recommended that all mode switching be done by a BIOS service 
function call, rather than by manually setting the CRTC. The main reasons for doing 
this are: 


* Simplicity 
* The characteristics of the non-volatile storage device that stores mode and 
monitor information may not be known. Without monitor information, the 


only mode guaranteed to work on all analog monitors is 640x480 60 Hz non- 
interlaced. 


* CRTC compatibility with future devices is not guaranteed. 


Note that there are two logical CRT controllers (CRTCs) on the mach64. CRT 
parameters may be set independently on the VGA and accelerator CRTCs, and 
flipped from one to the other with a simple BIOS call. Screen memory may be shared 
to give one or the other device more screen memory, or it may be divided into two 
areas so that both logical devices may operate concurrently without having to save 
and restore screen data. 


The default memory boundary setting is set from the install program, but may be 
overridden by the application program by setting the appropriate bits in the 
MEM_CNTL register. In addition, some configurations of the mach64 may have the 
VGA disabled. 


BIOS Interface 


VGA modes are initialized with the standard INT 10h interface, as described in the 
mach64 VGA Programmer's Guide. See Appendix A, BIOS Services, for a complete 
definition of all BIOS services. 


To set the accelerator mode using the BIOS: 


1. Calculate the BIOS segment by reading SCRATCH_REG1 and computing: 
segment = (SCRATCH_REG1 & 0x7F) * 0x80 + 0xC000 


2. Call function 2 (load accelerator CRT parameters and set display mode) of the 


BIOS. 
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Sample code BMODE.DOC 


This is sample code to set an accelerator mode using the Mach 64 ROM 
services. See ROMCALLS.ASM for more details. 


3; Data section 


rom_addr dw 64h 
dw 0c000h 
db 0 


3; Code section 


PARM equ 6 3; passed parameters start at bp+6 for large model 


LOAD_AND_SET_MODE 


Load accelerator mode parameters and set accelerator mode. Input parameters 
are fetched from the stack. 


Inputs : WORD resolution code 
CH = 12h - 640x480 
CH = 6Ah - 800x600 
CH = 55h - 1024x768 
CH = 83h 1280x1024 


WORD pitch code 
CL [bits 7-6] = 0 - 1024 
CL [bits 7-6] = 1 - don’t change 
CL [bits 7-6] = 2 - pitch size = resolution width 


WORD deep color code 
CL [bits 3-0] = 1 - 4 bpp 
CL [bits 3-0] = 2 - 8 bpp 
CL [bits 3-0] = 3 - 15 bpp (555) 
CL [bits 3-0] = 4 - 16 bpp (565) 
CL [bits 3-0] = 5 - 24 bpp 
CL [bits 3-0] 6 - 32 bpp 


Outputs: Returns error code in ax 

AX = 0 - no error 
= 1 - function complete with error 
AX = 2 - function not supported 


Re 


me me me me Se Me Oe =e 06 me me me me =e te me me =e 06 ON we me 6M me me me m™e =e me me 


public load_and_set_mode 


= 
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load_and_set_mode proc far 


3; Create frame pointer 
push bp 
mov bp, sp 


3} save registers used 
push cx 


MODE SWITCHING 


3; setup parameters for call to ATI rom 


call rom_base 

mov rom_addr+2, ax 

mov ax, WORD PTR [bp+PARM] 
mov ch, al 

mov ax, WORD PTR [bp+PARM+2] 
shl al, 6 

mov cl, al 

mov ax, WORD PTR [bp+PARM+4] 
and al, 7 

or cl, al 

mov ax, 2 

mov rom_addr, 64h 

call DWORD PTR rom_addr 


3; setup error code in AL 
mov al, ah 
xor ah, ah 


3; restore saved registers 
pop cx 


7 remove frame pointer 


mov sp, bp 
pop bp 
ret 


load_and_set_mode endp 


SET_DISPLAY_MODE 


Inputs : WORD display mode 
CL = 0 - VGA 
CL = 1 - Accelerator 


Outputs: Returns error code in ax 
AX = 0 - no error 


i i | i, i a i a, | 


7 get rom segment in ax 


get resolution code 


=e 


get pitch code 


=e 


7 get deep color code 


function code 2 


me 


call ROM 


=e 


Set display to accelerator (after a call to LOAD _MODE_PARMS()) or VGA mode. 
Input parameters are fetched from the stack. 


AX = 1 - function complete with error 
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AX = 2 - function not supported 


me 6 


public set_display_mode 
set_display_mode proc far 

3; create frame pointer 

push bp 


mov bp, sp 


3; save registers used 
push cx 


3; setup parameters for call to ATI rom 


call rom_base 3; get rom segment in ax 
mov rom_addr+2, ax 

mov ax, WORD PTR [bp+PARM] 3; get display mode flag 
mov cl, al 

mov ax, 1 3; function code 1 

mov rom_addr, 64h 

call DWORD PTR rom_addr 3; call ROM 


3 setup error code in AL 
mov al, ah 
xor ah, ah 


3 restore saved registers 
pop cx 


3 remove frame pointer 


mov sp, bp 
pop bp 
ret 


set_display_mode endp 


ROM_BASE 
Retrieve base segment of MACH64 ROM. 
Inputs : none 


e 
a 
e 
a 
° 
a 
° 
’ 
e 
s 
° 
s 
e 
s 
e 
as 


Outputs: Returns base rom segment in ax (usually C000h or C800h) 


e 
, 


public rom_base 
rom_base proc far 


3; save registers used 
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push cx 
push ax 


3; retrieve rom segment address from MACH64 register 


mov dx, ioSCRATCH_REG1 
in al, dx 

and © al, 7Fh 

mov ah, 0 

mov cl, 7 

shl ax, cl 

add ax, 0C000h 


3; restore saved registers 


Pop dx 

pop cx 

ret 
rom_base endp 
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Manual Mode Switching 


ee 


Mode switching by manual means is not recommended. If for some reason this 
cannot be avoided, here’s how to do it: 


1. 


mach64 subsystems must always be configured with a non-volatile storage 
system for storing mode and monitor information. The application 
programmer must detect what kind of non-volatile storage is on board and 
access it appropriately to retrieve mode information. The most common 
configuration uses an EEPROM. See Appendix B, EEPROM Map, for the 
storage mapping of the EEPROM, and Chapter 5, Advanced Topics II for 
information on how to access the EEPROM. 


Set the accelerator CRT controller using the information retrieved in step 1. 


Detect the type of DAC used by reading the CONFIG_STATO register. 
Additional detection may be required for DACs that are upward-compatible 
with the supported DAC types. Consult the manufacturer’s DAC data sheet. 


Initialize the DAC to the appropriate pixel depth and mode using DAC_CNTL 
and DAC_REGS. Consult the appropriate manufacturer’s DAC data sheet. 
Information on how to program standard DACs to standard modes is 
provided in Chapter 5, Advanced Topics II. 


Switch from VGA mode to accelerator mode by setting the 
CRTC_EXT_DISP_EN bit in the CRTC_GEN_CNTL register. 


Designing a Custom CRT Mode 


PRG888GX0-01 
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The following illustration shows how the CRTC and overscan registers correspond 
to an actual video mode. The actual addressable displayed area is bounded by 
H_DISP and V_DISP. All registers are referenced to the upper left corner of the 
display area. 
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Vertical sync 
polarity and 
horizontal sync 


polarity are not 
shown in this 
illustration, 


V_TOTAL 


OvR_WD_BOTTOM 


OVR_WID_TOP $ 


H_TOTAL 


H_SYNC_START 


MODE SWITCHING 


OVR_WID_RIGHT OVR_WID_LEFT 
—> +> 


V_SYNC_START 


V_SYNC_WIDTH 


H_SYNC_WIDTH 


The relationships between CRT parameters and monitor parameters are listed in 


the following table: 


- pixel clock rate (Hz) 

. pixel clock period (sec) 

~ horizontal displayed resolution (pixels) 
- horizontal sync rate (Hz) 

' horizontal front porch (sec) 

- horizontal back porch (sec) 

- horizontal sync width (sec) 

- horizontal active time (sec) 

: horizontal blank time (sec) 

- vertical displayed resolution (pixels) 
- vertical sync rate (Hz) 

_ vertical front porch (sec) 

vertical back porch (sec) 

vertical sync width (sec) 

- vertical active time (sec) 

- vertical blank time (sec) 
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Hprs/ 8-1 

PCLK / Hsync/8 - 0.5 

Hsyp* PCLK/8 + 0.5 

(Hpes + Hpp* PCLK + 0.5) /8-1 


(H_DISP + 1)* 8 

PCLK / (H_TOTAL + 1)/8 

H_SYNC_WID * 8/PCLK 

(H_SYNC_STRT - H_DISP) * 8 / PCLK 

(H_TOTAL - H_SYNC_STRT - H_SYNC_WID) * 8 / PCLK 
(H_TOTAL - H_DISP) * 8 / PCLK 

(H_DISP + 1) * 8 / PCLK 

V_DISP +1 

HSYNC / (V_TOTAL + 1) 

V_SYNC_WID / HSYNC 

(V_SYNC_STRT - V_DISP) / HSYNC 

(V_TOTAL - V_SYNC_STRT - V_SYNC_WID) / HSYNC 
(V_TOTAL - V_DISP / HSYNC 

(V_DISP + 1) / HSYNC 


Pixel Clocks Pixel clocks may be chosen from the ATI1881X clock chip. Refer to Appendix D, 
Clock Chip Reference, for more details. 
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Example CRTC calculation for 640x480 60 Hz non-interlaced: CRTC .DOC 


Example CRTC calculation for 640x480 60 Hz non-interlaced: 


Hres = 640 

Hsync = 31.469 KHz 
Hswid = 3.813 usec 
Hfp = 0.953 usec 


Vres = 480 

Vsynec = 59.94 Hz 

Vewid = 0.064 msec 

Vip = 0.350 msec 

Pelk = 50.35 / 2 = 25.18 MHz (ATI1881X clock chip selection 4) 


Hpol = negative polarity 
Vpol = negative polarity 


H_TOTAL = (Pelk / Hayne / 8) + 0.5 
= (25.18 MHz / 31.469 KHz / 8) - 0.5 
= 99.52 = 63h 

H_DISP = Hres / 8 - 1 = 640 /8 -1 
= 79 = 4fh 


H_SYNC_STRT = (Hres + Hfp * Pclk + 0.5) / 8 - 1 
= (640 + 0.953 usec * 25.18 MHz + 0.5) / 8-1 
= 82.06 = 52h 


H_SYNC_WID = (Hswid * Pclk) / 8 + 0.5 
= (3.813 usec * 25.18 MHz) / 8 + 0.5 
= 12.50 = Och -> Och + 20h (- polarity) = 2ch 


V_TOTAL = (Hsync / Vsync) - 0.5 
= (31.469 KHz / 59.94 Hz) - 0.5 
= 524.51 = 20ch 


V_DISP = Vres - 1 = 479 = idfh 
V_SYNC_STRT = Vres + Vfp * Hsync + 0.5 


= 480 + 0.350 msec * 31.469 KHz - 0.5 
= 490.51 = leah 
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V_SYNC_WID (Vswid * Hsync) + 0.5 
(0.064 msec * 31.469 KHz) + 0.5 


2.51 = O2h -> 02h + 20h (- polarity) = 22h 


CLOCK_CNTL = 14h (clock chip selection 4, divide by 2) 


ACCELERATOR MODE 


In accelerator mode, there are two ways of accessing screen memory: 
* Memory aperture 
* Draw engine. 


Memory Aperture 


The host application may read or write directly to screen memory through a 
memory aperture (an aperture is an address space that maps directly to on-board 
memory). Accesses through the aperture provide no acceleration, and the speed of 
these accesses are generally bound by the speed of the host expansion bus. 


Draw Engine 


The second way of accessing the memory is to use the draw engine to write to it. 
The draw engine can do two things: 


* Rectangle fills 


¢ Lines. 
Destination These are known as destination trajectories. These trajectories may be filled with 
hafta pixel data from various sources. If the source data comes from graphics memory, 
this is called a bitblt (or blit) and follows one of four different flavors of source 
trajectory. 
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Engine Initialization 

Initializing or setting up a standard context for the Mach64 engine is done 
using the following attributes: 

* Engine reset and enabling 

¢ FIFO control, including clearing FIFO overflow errors 

¢ Source and destination pitch 

* Source and destination offset into video memory 

* Color depth 

° Host data control 

* Pattern data control 

¢ Line and rectangle control 

* Color source, mix, and compare control 

* Scissor settings 


e Write mask 


Sample code 


This is sample code to initialize the Mach 64 engine to a known context. The 
values here may be changed to suit a specific application. The engine 
registers are memory mapped and therefore require an enabled aperture to 
access them. See discussion on apertures. For more details, see INIT.C. 


[Ber nn nn ee en ee eee eee e 
INIT_ENGINE - set the MACH64 engine to a standard context. 


This routine configures the MACH64 engine to a known typical context. 
The context consists of: 


-engine reset and enabling 

-fifo control including clearing fifo overflow errors 
-source and destination pitch 

-source and destination offset into video memory 
-color depth 

~host data control 

-pattern data control 


-line and rectangle control 
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-color source, mix, and compare control 
-scissor settings 


-write mask 


void init_engine (void) 


{ 
unsigned long pitch_value, xres, yres; 


// Determine modal information from global mode structure - this 
// structure is initialized in function OPEN_MODE(). 


xres = (unsigned long) (modeinfo.xres) ; 
yres = (unsigned long) (modeinfo.yres) ; 
pitch_value = (unsigned long) (modeinfo.pitch) ; 


// Adjust pitch if the selected mode has a color depth of 24 bpp. 

if (modeinfo.bpp == 24) 

{ 
// In 24 bpp, the engine is actually in 8 bpp - the pitch is adjusted 
// by multiplying its value by 3 
pitch_value = pitch_value * 3; 


// Reset engine and clear any errors 
reset_engine(); 


// Ensure that vga page pointers are set to zero - the upper page 
// pointers are set to 1 to handle overflows in the lower page 
iLow(LoMEM _VGA_WP_SEL, 0x00010000); 

iow(LoMEM_VGA_RP_SEL, 0x00010000); 


// Setup standard engine context 
wait_for_fifo(14); 


regw(CONTEXT_MASK, OxFFFFFFFF) ; 


regw(DST_OFF_PITCH, (pitch_value / 8) << 22); 

regw(DST_Y_X, 0); 

regw(DST_HEIGHT, 0); 

regw(DST_BRES_ERR, 0); 

regw(DST_BRES_ INC, 0); 

regw(DST_BRES_ DEC, 0); 

regw(DST_CNTL, DST_LAST PEL | DST_Y_TOP_TO_BOTTOM | DST_X_LEFT_TO_RIGHT); 


regw(SRC_OFF_PITCH, (pitch_value / 8) << 22); 
regw(SRC_Y_ X, 0); 

regw(SRC_HEIGHT1_ WIDTH1, 0); 

regw(SRC_Y_X START, 0); 
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regw(SRC_HEIGHT2_WIDTH2, 0); 
regw(SRC_CNTL, SRC_LINE_X_ LEFT TO_RIGHT) ; 


wailt_for_fifo(13); 
regw(HOST_CNTL, 0); 


regw(PAT REGO, 0); 
regw(PAT_REG1, 0); 
regw(PAT_CNTL, 0); 


regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 
regw(SC_BOTTOM, yres-1); 
regw(SC_RIGHT, pitch_value-1); 


regw(DP_BKGD_CLR, 0); 
regw(DP_FRGD_CLR, OxFFFFFFFF) ; 
regw(DP_WRITE_ MASK, OxFFFFFFFF) ; 
regw(DP_MIX, FRGD_MIX_S | BKGD_MIX_D); 
regw(DP_SRC, FRGD_SRC_FRGD CLR); 


wait_for_f1ifo(3); 


regw(CLR_CMP_CLR, 0); 
regw(CLR_CMP_MASK, OxFFFFFFFF) ; 
regw(CLR_CMP_CNTL, 0); 


switch (modeinfo.bpp) 
{ 
case 4 : init_4bpp(); break; 
case 8 : init_S8bpp(); break; 
case 16: 
if (modeinfo.depth == 555) 
{ 
init_15bpp (); // 555 color weighting 
} 
else 
{ 
init_16bpp(); // 565 color weighting 
} 
break; 
case 24: init_24bpp(); break; 
case 32: init _32bpp(); break; 
} 


wait_for_idle(); // insure engine is idle before leaving 


/* a ce in ee cu Se eh hm Oe ee ss ee eee ce oe Oe ee ee Oe eee cee eee ee ee ee 


RESET _ENGINE - reset engine and clear any FIFO errors 
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This function resets the GUI engine and clears any FIFO errors. 


(ee ea hs kes me ee meme ee wee eee eae meee mene ccm we me ee ee eee ee ee eee eee oe eee */ 
void reset_engine(void) 
{ 
// reset engine 
Low(LoGEN_TEST_CNTL, 0); 
Low(1oGEN_TEST_CNTL, GUI_ENGINE_ENABLE) ; 
// Ensure engine is not locked up by clearing any FIFO errors 
iow(LoBUS_CNTL, (lor(1LoBUS_CNTL) & OxffOOfff£) | Ox00ae0000) ; 
} 
/ Fe ee eae ee we ww ee ee es ee ie ee es em ee ee ee ee me oe eee ae oe we mem mt mn sm mt ems 0 oe ne ws ee me es oe oe 
INIT_4BPP - set the MACH64 engine to a 4bpp standard context. 
This routine is used in conjuction with INIT _ENGINE(). 
(oon mam tive ane aa em) ty ms We OOS Mase GS GUY tS ms GD fm Game fs ies cy ae em ems my es eee meme mm me oe ee coe ome ms see ms os me Oe os * / 
void init_4bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_ WIDTH, HOST_4BPP | SRC_4BPP | DST_4BPP | 
BYTE_ORDER_MSB_TO_LSB); 
regw(DP_CHAIN MASK, 0x8888); 
} 
/ De ce ee ae we ee we ee oe om om we oes es one an eae ne cs ae om ome ene ee te Oe tom Oe fae oe se OO OD OD OS OD OS Oe Om nD Oa ee Meme We Oe Om Ome Oe mee me ee ee oe ee ee ee 
INIT_8BPP - set the MACH64 engine to a 8bpp standard context. 
This routine is used in conjuction with INIT_ENGINE(). 
ee eee me ec ec ee ee ee tt ees ceo ee ee ee ee ce */ 
void init_8bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_ WIDTH, HOST 8BPP | SRC_8BPP | DST_8BPP | 
BYTE_ORDER_LSB_TO_MSB) ; 
regw(DP_ CHAIN MASK, 0x8080); 
} 
/ PO ae ee ee we we ee om om wae ome en see 0 0s ae ms ee ee ee ee ee ee me ne ee oe oe meso 
INIT_1I5BPP - set the MACH64 engine to a i5bpp standard context. 
This routine is used in conjuction with INIT _ENGINE(). 
(a mmm sey ns ae tame) DO OS See Se Oa em SOP aD Sem Gee Oe MD SG OD GE ma Om ie Os Om ne ee cea eee coe ee ee > te Oo es ee ee te */ 
void init_15bpp (void) 
{ 
wait_for_f1ifo(2); 
regw(DP_PIX WIDTH, HOST_1ISBPP | SRC_1I5BPP | DST_15BPP | 
BYTE_ORDER_LSB_TO_MSB) ; 
regw(DP_CHAIN_MASK, 0x4210); 
} 
/ Be te se ee a eee eee ee oe ee ow es me ees na ements ee eee mee ee ee eee eee ee eee eee ee ee ee ee ee 
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INIT_16BPP - set the MACH64 engine to a 16bpp standard context. 


This routine is used in conjuction with INIT_ENGINE(). 


ah imp Sm i ~s  s"t- y-n tmt imitp. i ee“en eee A SmmS <Geh‘lp.mSSssm omi,eGit l mn‘,e  ,Sem . i’ “it: Sm“,ie e */ 
void init_16bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_WIDTH, HOST_16BPP | SRC_16BPP | DST_1I6BPP | 
BYTE_ORDER_LSB_TO MSB); 
regw(DP_CHAIN MASK, 0x0410); 
} 
/* ee oe ee eae we we cas am ws ees oe oe en ee ee Os Oe Oe en sO Oe ee ee ee 
INIT_24BPP - set. the MACH64 engine to a 24bpp standard context. 
This routine is used in conjuction with INIT _ENGINE(). 
(ove ein smun me ont sean face ce ane coms alm Gam Come em mmm mm com um mh GED nS cams Om cums Sem cm cue ae mem es cae ems Oe Gem ts th ey ees, ms > em Oe ee sme om te aes ee */ 
void init_24bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_ WIDTH, HOST_8BPP | SRC_8BPP | DST_S8BPP | 
BYTE_ORDER_LSB_TO_MSB) ; 
regw(DP_CHAIN MASK, 0x8080); 
} 
/* wee ew wwe ew ew ee ew ee mee ee ee ewe we ee ee we ew ew ee we ew ee ee we eee ee ewe eee ee eee 
INIT_32BPP - set the MACH64 engine to a 32bpp standard context. 
This routine is used in conjuction with INIT_ENGINE(). 
(eae cee came as ee ome ae oe me ee oe ee ose ae 0 oh wo a a ee eee ee oes we ta en en eo fees ee een we */ 
void init_32bpp(void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX WIDTH, HOST_32BPP | SRC_32BPP | DST_32BPP | 
BYTE_ORDER_LSB_TO_MSB) ; 
regw(DP_CHAIN MASK, 0x8080); 
} 
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THE LINEAR AND PAGED MEMORY APERTURES 


Memory on the mach64 may be directly accessed in one of three ways: 
* Standard paged 64k VGA aperture 
¢ Small dual paged apertures 
* Big aperture. 


Standard Paged 64k VGA Aperture 


If the VGA is enabled, and the mach64 is in VGA mode, the lower 1MB of memory 
may be accessed through the standard paged 64k VGA aperture. The segment base 
address of this aperture is either 0xA000 or 0xB000, depending on the video mode. 
Any memory writes via the VGA aperture are inhibited when the memory 
boundary is enabled. Read and write pages are set independently on 64k 
boundaries with this aperture type. 


For more information on how to page the 64k aperture, see the mach64 VGA 
Programmer’s Guide. 


Small Dual Paged Apertures 


If the mach64 is in accelerator mode, two small 32k apertures may optionally be 
enabled at segment base addresses 0xA000 and 0xA800. The read and write pages 
are set independently on 32k boundaries for each of the two apertures with the 
MEM_VGA_WP_SELand MEM_VGA_RP_SEL registers. This aperture mode is a 
type of VGA aperture configuration that is not available in standard VGA modes. 
If the memory boundary is enabled, writes to these apertures are inhibited. 


¢ These small apertures can access the full 8MB. 
* These small apertures may be enabled only if the VGA is enabled on the chip; 


= al otherwise, a memory address conflict would exist between the accelerator and 
the existing VGA. 

PCibus * These small apertures are not supported in the Peripheral Component 

implementation Interconnect (PCI) bus implementation. 
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Sample code ABLIT.DOC 


This is sample code to show usage of the VGA 32K apertures for transferring 
pixel data from one region to another. This particular example uses the 
apertures to transfer one line at a time to show the paging and page offset 
calculations. There are two write and two read 32K apertures. They may be 
used separately or combined to effectively make one 64K write aperture and 
one 64K read aperture. The combined approach has the advantage to handle 
‘overflows’ if the lower 32K aperture size is exceeded. 


Example code to perform a screen to screen blit using the small VGA 32K 
apertures. A multi-colored filled rectangle is drawn on screen as the 
source to be blited. This source is blited by copying the pixel data 

to another screen region using the VGA 32K apertures. 


This example assumes that the VGA controller is NOT disabled. 


Copyright (c) 1994 ATI Technologies Inc. All rights reserved 


#include <stdio.h> 

#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 

#include <dos.h> 

#include “..\util\atim6é4.h” 
#include “..\util\sample.h” 


/* ee ee me em me oe es om om en in ee Os oe i es ee ee a a eee se ee ee me se oe en as 
(eee ee coe rm ch came cate em me mee care ce eM eh wt ne si css me eee le es mh ee en ee ee ae ee ce Mo es Om Ot it em eS es ees ee */ 
unsigned long getpageptr(unsigned long addr) 
{ 

unsigned long pageptr; 

pageptr = addr / 0x8000; 

return (pageptr) ; 
/* ee 

GETADDR - calculate the memory address from a given set of coordinates. 


A byte address is returned. 


unsigned long getaddr (int x, int y) 
{ 
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unsigned long addr, xadd; 


// calculate byte address from given coordinates (x, y) 
xadd = (unsigned long) x; 

addr (unsigned long) y; 

addr = (unsigned long) (addr * modeinfo.pitch) ; 

if (modeinfo.bpp == 4) 


{ 
addr = addr / 2; 
xadd = xadd / 2; 
} 
else 
{ 
addr = (unsigned long) (addr * (modeinfo.bpp / 8)); 
xadd = (unsigned long) (xadd * (modeinfo.bpp / 8)); 
} 


addr = addr + xadd; 


return (addr); 


/* eR we Ee ee Ee EON EE OEE EOE OR OOO EE eee eee ee 
GETOFFSET - calculate offset into 32K VGA aperture from the physical 
address and page pointer. 


unsigned long getoffset (unsigned long phyaddr, unsigned long pageptr) 
{ 
unsigned long ptraddr, offset; 


ptraddr = pageptr * 0x8000; 
offset = phyaddr - ptraddr; 


return (offset); 


APPBLIT - perform a screen to screen blit using small 32K VGA apertures. 


This routine uses the lower read 32K VGA aperture as the source buffer to 
read a line of screen memory. This is transferred to the lower write 32K 
VGA aperture which represents the destination buffer. The upper 32K VGA 
aperture is used large block transfers or for overflows from the lower 
32K aperture. 


It is assumed that the VGA aperture is enabled. 


aco eon owe ome ee se came tees tiny it) ett aah UD Gn ED CO Gow mu Gam Sum n> ume mim tis um On Gam Cons cm mms ms ee me es eee ae ee ea ee Oe Oa Oe OD OO Oe ee SD Un ee DD ee en ne Oe Oe ee ey es oe * / 
void appblit(int x1, int yl, int x2, int y2, int width, int height) 
{ 
unsigned long readaddr, writeaddr; 
unsigned long readpage, writepage; 
unsigned long readoffset, writeoffset; 
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int ySrc, yDst, index; 
int linesize; 


// calculate linesize in bytes 
1£ (modeinfo.bpp == 4) 
{ 
linesize = width / 2; 
} 
else 
{ 
linesize = width * (modeinfo.bpp / 8); 
} 


// transfer source to destination 
for (ySre = yl, yDst = y2; ySre < (yl + height); ySrc++, yDst++) 
{ 

// set read pointer to next source line address 

readaddr = getaddr(xl1, ySrc); 

readpage = getpageptr(readaddr) ; 

readoffset = getoffset(readaddr, readpage); 


// upper 32k VGA aperture pointer is used for overflow 
regw (MEM VGA _RP SEL, ((readpage+1) << 16) | readpage); 


// set write pointer to next destination line address 
writeaddr = getaddr(x2, yDst); 

writepage = getpageptr(writeaddr) ; 

writeoffset = getoffset(writeaddr, writepage); 


// upper 32k VGA aperture pointer is used for overflow 
regw(MEM VGA _WP_SEL, ((writepage+1) << 16) | writepage) ; 


// perform screen memory copy of one line 

for (index = 0; index < linesize; index++) 

{ 
*((unsigned char far *) (LOW_APERTURE_BASE + writeoffset + index)) = 
*((unsigned char far *) (LOW_APERTURE_BASE + readoffset + index) ); 


/* Main C program */ 


int main(void) 
{ 
// check if Mach64 adapter is installed 
if (detect_mach64() != YES_MACH64) 
{ 
printf (“Mach64 based adapter was not found.\n”); 
return (1); 
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// £111 global query structure by calling Mach 64 ROM 

1f (query_hardware() != NO_ERROR) 

{ 
printf (“Failed ROM call to query Mach6é4 hardware.\n”); 
return (1); 

} 


// check if Mach 64 VGA controller is enabled 
if (querydata.vga_type != VGA_ENABLE) 
{ 
printf(“This sample code example requires an enabled Mach 64 VGA 
controller. \n”); 
return (1); 
} 


// The VGA controller is normally set in planar mode. Data transfer 

// through the VGA aperture (low and high 32K pages) requires that the 
// VGA controller be set in a packed pixel mode where the pixel data 
// is arranged contigiously. 

set_packed_pixel(); 


// set an accelerator mode 
Lf (open_mode(MODE_640x480, PITCH _XRES, COLOR_DEPTH_8) != NO_ERROR) 
{ 
printf (“Error in setting display mode.\n”); 
return (1); 
} 


// Initialize standard engine context 
init_engine(); 
clear_screen(0, 0, modeinfo.xres, modeinfo.yres) ; 


// Draw some rectangles to act as a blit source 
set_fg_ color(get_color_code (YELLOW) ) ; 
draw_rectangle(0, 0, modeinfo.xres / 2, modeinfo.yres / 2); 


set_fg color(get_color_code(LIGHTRED) ) ; 
draw_rectangle((modeinfo.xres / 32), 
(modeinfo.yres / 24), 
(modeinfo.xres / 2) - (2 * (modeinfo.xres / 32)), 
(modeinfo.yres / 2) - (2 * (modeinfo.yres / 24))); 


set_fg_color(get_color_code(LIGHTGREEN) ) ; 

draw_rectangle((2 * (modeinfo.xres / 32)), 
(2 * (modeinfo.yres / 24)), 
(modeinfo.xres / 2) - (4 * (modeinfo.xres / 32)), 
(modeinfo.yres / 2) - (4 * (modeinfo.yres / 24))); 


set_fg_color(get_color_code(LIGHTBLUE) ) ; 
draw_rectangle((3 * (modeinfo.xres / 32)), 
(3 * (modeinfo.yres / 24)), 
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(modeinfo.xres / 2) - (6 * (modeinfo.xres / 32)), 
(modeinfo.yres / 2) - (6 * (modeinfo.yres / 24))); 


// Draw a DARKGRAY rectangle to show where the destination blit will be 
// drawn 
set_fg color(get_color_code(DARKGRAY) ) ; 
draw_rectangle(modeinfo.xres / 2, modeinfo.yres / 2, 
modeinfo.xres / 2, modeinfo.yres / 2); 


// wait for a carriage return 
getch(); 


// perform blit using small 32K VGA apertures 
appb1it(0, 0, modeinfo.xres / 2, modeinfo.yres / 2, 


modeinfo.xres / 2, modeinfo.yres / 2); 


// wait for a carriage return 
getch(); 


// @isable accelerator mode and switch back to VGA text mode 
close_mode() ; 


return (0); 
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Big Aperture 


If the mach64 is in accelerator mode, a big linear aperture may be optionally enabled 
to access the entire frame buffer. This aperture may be 4M or 8M in size, and is 
locatable on 4M or 8M boundaries, respectively, anywhere in the 4G address space. 
The memory boundary does not affect memory accesses through this aperture. 


The availability of this aperture is assured on all board configurations except ISA 
bus configurations. On an ISA system, the following two conditions must be met 


in order to use the big aperture: 
* The aperture must fit within a 16MB address space. 
¢ The aperture must not overlap host CPU memory. 


A typical set would be a system with 8MB of host CPU memory, with a big aperture 


of 4MB or 8MB, starting at an address of 8MB. 


Sample code LINAPP .DOC 


See source file LINAPP.C for more details. 


unsigned int buffer[200]; 


int main(void) 


{ 
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int config, memcntl, i, j; 
unsigned long appaddr; 
union REGS regs; 


// check if Mach6é4 adapter is installed 

if (detectmach64() != YES MACH64) 

{ 
printf (“Mach6é4 based adapter was not found.\n”); 
return (1); 

} 


// set an accelerator mode 
if (load_and_set_mode(MODE_640x480, PITCH_XRES, COLOR_DEPTH_8) 
{ 
printf (“Error in setting display mode.\n”); 
return (1); 
} 


// enable the linear memory aperture 


!= NO_ERROR) 
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outpw(ioCONFIG_CNTL, inpw(ioCONFIG_CNTL) | APERTURE_4M_ENABLE) ; 


// set engine context 
init_engine(MODE_ 640x480, PITCH XRES, COLOR_DEPTH_ 8); 
clear_screen(0, 0, 640, 480); 


// get linear memory aperture base address 

config = inpw(ioCONFIG_CNTL) ; 

appaddr = (unsigned long) ((config & Ox3ff0) >> 4); 

appaddr = appaddr << 22; 

1f (appaddr == 0) 

{ 
// go back to VGA mode 
set_display_mode(VGA_MODE) ; 


// reinitialize VGA 
regs.X.ax = 3; 
int86(0x10, &regs, &regs); 


printf(”“Error - linear memory aperture has a zero address.\n”); 
return (1); 
} 


// insure that memory is shared between VGA and acelerator for full 
// memory access 

mementl = inpw(iloMEM_CNTL+2) ; 

outpw(LoMEM CNTL+2, 0); 


// £111 buffer with data and write to screen memory through linear aperture 
for (i = 0; i < 200; 14+) 
{ 
buffer[i] = 1; 
} 
movemem( (void far *)buffer, appaddr, 200, MEM WRITE); 


// clear buffer and read screen memory back 
for (1 = 0; 1 < 200; 1++) 
{ 


buffer [i] 0; 


} 
movemem( (void far *)buffer, appaddr, 200, MEM READ); 


// xvrestore MEM CNTL register 
outpw(LoMEM CNTL+2, memcnt1) ; 


// go back to VGA mode 
set_display_mode(VGA_MODE) ; 


// Since the accelerator and VGA controllers share the same video memory, 
// it is necessary to reinitialize the VGA text mode before exiting. Text 
// mode 3 is used here. 

regs.x.ax = 3; 
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int86(0x10, &regs, &regs); 


// dump buffer to VGA screen 
printf(“\nLINEAR APERTURE read dump - values should be 0 to C7h\n”); 
j = 0; 
for (1 = 0; i < 200; i++) 
{ 
printt(“%04X “”, buffer[i]); 
j++; 
if (j > 9) 
{ 
j = 0; 
printf (“\n”); 
} 
} 
printft(“\n"”); 


return (0); 
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REGISTER MAPPING 


All registers not associated with the draw engine are I/O mapped, and all except 

CONFIG_CNTL have memory mapped register aliases. AllI/O mapped addresses 
have a base of 0x2EC (in the ISA style of sparse I/O decoding, the base address is 
the lower ten bits, and the upper six bits are used for modifier bits — in this case, 

they are register selects) and are 32 bits wide. 


Allregisters, except for CONFIG_CNTL,arememory-mapped. Allregisters, except 
for DAC_REGS (they are four, eight-bit registers), are 32 bits wide. 


¢ If the small apertures are enabled, the registers may be accessed through a 
1K area at segment:offset of 0xB000:0xFCO0O. 


¢ If the big aperture is enabled, the registers occupy the address space located 
at the base address of the aperture, plus an offset of Ox3FFC00 for a 4M 
aperture or 0x7FFCO0 for a 8M aperture configuration. 


Sample code =REGMAP.DOC 


This is sample code to show how to access the memory mapped registers using 
the VGA and linear apertures. 


Example code to write and read a memory mapped register using the VGA 
aperture, 4M linear aperture size, or 8M linear aperture size. For more 
details, see functions REGW() and REGR() in MEMREG.C. 


Note that for GUI register access with the linear aperture, the aperture 
address must be valid. The INSTALL program should be used to ENABLE the 
linear aperture. 


Failures: 


- VGA aperture register access will fail if the VGA controller is disabled 


- Linear aperture register access will fail if the memory mapped 
register area is overlapping extended memory or resides at an invalid 
address 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


SE SSS SE ESE ESE BEEBE SSS SS SS SSE SESS ESS PB SE SSS SS SS SSS SSE SSS SS SSS SSS ETS Sse / 
#include <stdio.h> 

#include <stdlib.h> 

#include <string.h> 

#include <conio.h> 
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#include “..\util\atim64.h” 
#include “..\util\sample.h” 


int main(void) 


{ 
int config; 
unsigned long regval; 
// save config value 
config = inpw(iLoCONFIG_CNTL) ; 
// £4111 global query structure by calling Mach 64 ROM 
1£ (query_hardware() != NO_ERROR) 
{ 
printf(“Failed ROM call to query Mach64 hardware.\n”); 
return (1); 
} 
// Enable GUI engine to turn on GUI memory mapped register access 
reset_engine(); 
// Insure that fifo is empty before starting - enable linear aperture 
// to insure that the wait_for_idle() can read the FIFO_STAT and 
// GUI_STAT registers. The VGA aperture may not be available. This is 
// necessary since the register under test (PAT_REGO) is a GUI memory 
// mapped register. 
enable_linear_aperture (APERTURE_4M_ ENABLE) ; 
wait_for_idle(); 
|| wer rrr rr rr rr rn en ee ce ree nner ren meme nen ene nesseen= 
// Use the VGA aperture to access the GUI registers. The base address 
// is BO000:FCOOh. Each register occupies 4 bytes. 
// Use VGA aperture if the VGA controller is enabled 
if (querydata.vga_type == VGA_ENABLE) 
{ 
// enable the VGA aperture for register access 
disable linear _aperture(); 
enable_vga_aperture(); 
// Use this function to insure regw() and regr() work correctly 
update_aperture_status(); 
// write and read the PAT_REGO register using the VGA aperture 
regval = 0x555aa5aa; 
printf (“VGA aperture enabled : write value = %081X, read value =“, regval); 
regw(PAT REGO, regval); 
regval = regr(PAT_REGO); 
printf (“%081X\n”, regval); 
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} 
else 
{ 
printf(“VGA controller is disabled.\n”); 


[| crm mmm mm tre rr rn re en cer er eee ee nn rn ee er ne ee re eww entree re 
// Use the 4Meg linear aperture size to access the GUI registers. The 
// base address is the aperture address + O0x3FFCO0h. Each register 

// occupies 

// 4 bytes. 


// enable the 4M linear memory aperture without distribing address 
disable_vga_aperture() ; 
enable_linear_ aperture (APERTURE_4M ENABLE) ; 


// Use this function to insure regw() and regr() work correctly 
update_aperture_status(); 


// write and read the PAT REGO register using the 4Meg linear aperture 
regval = 0xaaa55a55; 
printf (“4M Linear aperture enabled: write value = %081X, read value = “", regval); 


regw(PAT REGO, regval); 


regval = regr(PAT_REGO); 
printf (“%081X\n”, regval); 


ff i= SS Se SSS Sse Se eee eee SSeS eS Seer ae ee essere 
// Use the 8Meg linear aperture size to access the GUI registers. The 
// base address is the aperture address + Ox7FFCO0Oh. Each register 

// occupies 4 bytes. 


// enable the 8M linear memory aperture without distribing address 
enable linear aperture (APERTURE_8M_ENABLE) ; 


// Use this function to insure regw() and regr() work correctly 
update_aperture_status(); 


// write and read the PAT REGO register using the 8Meg linear aperture 
regval = 0x5a55aaa5; 
printf (“8M Linear aperture enabled: write value = %081X, read value = ”, regval); 
' regw(PAT_REGO, regval); 


regval = regr(PAT_REGO); 
printf (“%081X\n”, regval); 


// restore configuration 
outpw(LoCONFIG_CNTL, config) ; 


return (0); 
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THE COMMAND FIFO 


All draw engine registers are memory-mapped with DWORD offsets greater than 
or equal to 0x40. All writes to draw engine registers are automatically routed 
through a 32-bit-wide, 16-entry-deep, command FIFO. All entries are consumed 
in the same order as written. 


* Note that host data registers do not generate extra wait states as did the 
mach32, and complete FIFO discipline is required for these registers as well. 


° Register reads are not FIFOed in any fashion. 
* Register writes to registers with DWORD offsets less than 0x40 are not FIFOed. 


Waiting for Sufficient FIFO Entries 


te Prior to any writes to any draw engine register, it is essential to check the state of 
the command FIFO to ensure that enough FIFO entries are available. Failure to do 
SO may cause the draw engine to lock. C source code that waits for n free entries is 
shown below: 


Sample code =CHKFIFO.DOC 


This is sample code to wait for a specific number of empty fifo entries. When 
a GUI memory mapped register is written to, it goes through the FIFO. If the 
FIFO fills up (16 or more writes without waiting for empty FIFO entries), the 
FIFO error bit will be set and all subsequent writes will be ignored until 
the condition is cleared (see RESET ENGINE() in INIT.C). It should be noted 
that since the GUI memory mapped registers are FIFOed during a write 
operation, the write width must be 32 bits. It is not necessary to check for 
empty fifo entries when reading GUI memory mapped registers or accessing 
non-GUI registers. 


/* rae ee mem ee ee meee oes ce ee cr eeu es tee ee ec ee ats cme cee mee ce sme cee eee ee eee eee ee ee ee ae 
WAIT _FOR_FIFO - wait for n empty FIFO entries. 


The FIFO is 16 entries by 32 bits wide. 


void wait_for_fifo(int entries) 
{ 
while ((regr(FIFO_STAT) & Oxffff) > ((unsigned int) (0x8000 >> entries) )); 


} 
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Waiting for Engine Idle 


There are two cases where the application must wait for the draw engine to become idle: 


°* The first case occurs when the application is depending on the draw engine to 
update a register or bit field (such as DST_X, or the scissor status bits in the 
GUL STAT register). The application must ensure idleness so that those 
registers will not be read back while in an intermediate state. 


The second case occurs when some draw operations are being performed by the 
draw engine, and memory is being accessed through an aperture, and spatial 
mutual exclusivity cannot be guaranteed. If an engine write and an aperture 
write are occurring in the same region, the pixel that lands on top will not be 
deterministic. If an engine write and an aperture read are occurring in the same 
region, the pixel that is read back may or may not be the pixel just drawn. 


Sample code CHKIDLE .DOC 


This is sample code to wait until the engine is idle. If the engine FIFO is 
empty, this does not imply that the engine is idle. An idle engine indicates 
that all operations have completed. When a GUI engine register is read, the 
engine should be idle to insure that the register is updated before the read 
takes place. 


/* ee ee ee ee ee ee ere 


void wait_for_idle(void) 

{ 
// wait for empty fifo 
wait_for_fifo(16); 


// wait until the engine is idle 
while ((regr(GUI_STAT) & 1) != 0) ; 
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LOGICAL PIXEL DATA PATH 


Monochrome to Two-color Color Expansion Circuit 


DP_MONO_SRC 


DP_FRGD_SRC 


DP_BKGD_COLOR 
DP_FRGD_COLOR 1 1,4,8,16,32 
Color Host 
Color Bit 
Color Patter: 


DP_FRGD_MIX 
DP_BKGD_MIX 


DP_BKGD_SRC 


1,4,8,16,32 
ALU Result 


DP_BKGD_COLOR Destination Read To Color Compare Circuit 
DP_FRGD_COLOR 


Color Host HE: Color Source 
Color Blit 4 1,4,8,16,32 


Color Patter 


Color Compare Circuit 
COLOR_CMP_SRC 
Destination Read CLR_CMP_FCN 


Color Source 


CLR_CMP_CLR 


1,4,8,16,32 
ALU Result 1,4,8,16,32 
To Screen Memory 
Destination Read 


Note: These illustrations are VERY important in understanding the mach64. 
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° For each pixel, a mono source bit is consumed. If it is 1’, the foreground color 
source and foreground mix are used; otherwise, the background color source 
and background mix are used. 


¢ The color source is mixed with the destination in the ALU, and passed on to 
the compare logic. If the color compare source is set for destination, the 
destination pixel is compared against the compare color; otherwise, the source 
color is compared against the compare color. 


* If the result of the comparison is FALSE, the mixed color is written to the 
destination; otherwise, the destination color is written to the destination. 


The pixel data path has inherent monochrome-to-color expansion characteristics. 
For instance, text operations — whose source data is monochrome — can be 
expanded to two colors by writing the text data to graphics memory, setting 
DP_MONO_SRC to Mono_Blit, DP_FRGD_SRC to DP_FRGD_CLR, and 
DP_BKGD_SRC to DP_BKGD_CLR and doing a blit operation. If monochrome 
expansion is not desired, simply set the DP_MONO_SRC to Always_’1’ and only 
the foreground color source will be used. 


The color compare circuit can be used for functions such as transparent blits, where 
a color source can be compared to a color that is used to determine whether a pixel 
is written to the screen or not. If no color comparison features are desired, simply 
set the color compare function to FALSE. 


All pattern sources in the above diagram refer to fixed patterns. All blit sources 
refer to a blit source trajectory, as described in Programming Model, Trajectories. 


Every draw engine operation must color-expand a monochrome source into two 
color sources. If only a single color source is desired, the monochrome source is set 
to Always_’1’ and only the foreground source and foreground mix are used. 


Color_Host means that data is to be written through the HOST_DATA register. If 
Color_Host is selected for one of the color sources, host data will be consumed for 
every pixel, regardless of whether the color MUX selects that color source. Note 
that the host pixel depth must be set to the same pixel depth as the destination pixel 
depth with the DP_PIX_WIDTH register. 


Also, the HOST_CNTL register controls consumption of host data on Ibpp and 4bpp 
data. If the HOST_BYTE_ALIGN bit is set, host consumption advances to the nearest 
byte boundary whenever the destination trajectory advances in the Y direction. 
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Color_Blit source selects one of the four possible source trajectories (see Trajectories 
on page 2-37). The option bits in the SRC_CNTL register determine which of the 
four trajectories is to be used. If Color_Blit is selected as one of the color sources, 
pixels from this source will always be consumed for each destination pixel 
consumed, regardless of whether the color MUX selects that source. 


Color_Pattern source are the 4x2 and 8x1 fixed color patterns. These patterns are 
only useful in 8bpp draw modes. The PAT_CNTL register is used to select which 
of the color patterns is to be used and the PAT_REG registers define the pattern itself. 


Always_’1’ monochrome source forces the foreground source and foreground mix 
to always be used. 


Mono_Pattern monochrome source is the 8x8 fixed mono pattern. PAT_CNTL 
enables the mono pattern and the PAT_REG registers define it. 


Mono_Host is much the same as Color_Host data, except that the host pixel depth 
must be set to monochrome with DP_PIX_WIDTH, and that monochrome data is 
color expanded into foreground and background color sources. 


Mono_Blit is much the same as Color_Blit except that the source pixel depth must 
be set to monochrome with the DP_PIX_WIDTH register, and the monochrome data 
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Host Data Consumption 


The following tables illustrate the order in which pixels are consumed from the 
HOST_DATA register. The shaded numbers indicate the bit position within the 
HOST_DATA register. The numbers in the table indicate the order of pixel 
consumption, starting from Zero. 


Ee 
Monochrome or 1bpp, right-to-left, 7{6{5]}4]3]2]1{0]15]}14]13]12}11] 10] 9 | 8 | 
DP_BYTE_PIX_ORDER@DP_PIX_WIDTH=0 | 23 | 22 | 21 | 20] 19 | 18| 17 | 16 | 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 
Monochrome or 1bpp, left-to-right, | 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22| 21 | 20 19| 18 | 17 | 16 | 
DP_BYTE_PIX_ORDER@DP_PIX_WIDTH=1 15|14/13/12/11}10] 9 a} 7] 6] 5]}4]3}2] 1] 0° 
Monochrome or 1bpp, right-to-left, fro}1{2]/3]4]}5}]6]7] 8] 9 |10]11{12]13] 14] 15 
DP_BYTE_PIX_ORDER@DP_PIX_WIDTH=1 


Abpp, left-to-right, 


Abpp, right-to-left, 1 3 7 5 A 7 
DP_BYTE_PIX_ORDER@DP_PIX_WIDTH=0 
bpp, left-to-right, 
DP_BYTE_PIX_ORDER@DP_PIX_WIDTH=1 2 ‘ 2 5 : 
Abpp, right-to-left, 
DP_BYTE_PIX_ORDER@DP_PIX_WIDTH=1 é : ‘ i ‘ 
spp lefttovight oe oe sal 
Bbpp, tight to left a a a iat ee ee ee 


16bpp,lefe-to-ight Sask DEE Son TEN 
jokes nance (ca Ren TORE SNE) 


Notes: 
° Host data consumption for 32 bits per pixel is self-evident. 
¢ Host data consumption for line draws is the same as for left-to-right 
trajectories. 
* Pixel consumption in 15bpp modes is the same as 16bpp modes. 


° Packed 24bpp mode is essentially 8bpp mode. R, G, and B component data 
must be fed in individually in 8-bit chunks. 


* The HOST_BYTE_ALIGN@HOST_CNTL bit may affect pixel consumption for 
Ibpp and 4bpp modes. When set, pixel consumption advances to the next 
nearest byte boundary whenever the destination advances in the Y direction. 
Line draws are unaffected. 
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Pattern Consumption 


Pattern consumption for the various fixed patterns is shown in the tables below. 
PO and P1 indicate PAT_REGO and PAT_REG1, respectively. The number(s) in 
parentheses are the bits within the pattern registers, which are used according to 
the destination pixel location. 


| Po) | Po) | Pos) | Pom) | Po) | Pow) | Pot) 
3 


) PO(4) ) (2) 
PO(30) 
) P1( P1(4) (3) ) 
P 1 P1(10 
1 
1 2 


( P 
(15) | P14) 

P 
( 


(1) 
0(9) 
) ) 
PO( ) (25) 
) 
pia) | P1(13) 1(9) 


Po | roo | rom | roa) | Poa) | row) | ros) | ro) | Pow) 
Pino 
Sn SS OC CC TC 


Bee Te P0(7:0) PO(15:8) PO(23:16) PO(31:24) 
aS eee P1(7:0) P1(15:8) P1(23:16) P1(31:24) 


PO(7:0) PO(15:8) PO(23:16) PO(31:17) P1(7:0) P1(15:8) P1(23:16) P1(31:24) 
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TRAJECTORIES 


TRAJECTORIES 


This table summarizes all possible trajectories. Source and destination trajectories 
are consumed independently, with the exception of source direction, which tracks 
the destination direction for blits. 


The Trajectory Registers column specifies all the registers that need to be initialized 
for the desired trajectory. 


The Enable/Initiate column indicates the bits that must be set to achieve the desired 
trajectory, or the register to write to in order to initiate the draw operation. All 
rectangular trajectories are X major. 


SRC_OFFSET 

if destination is line, then 
SRC_LINE_X_DIR@SRC_CNTL, else 
DST_X_DIR@DST_CNTL 


SRC_OFFSET, SRC_PITCH, SRC_X, 
SRC_Y, SRC_WIDTH1 

if destination is line, then also 
SRC_LINE_X_DIR@SRC_CNTL 


SRC_OFFSET, SRC_PITCH, SRC_X, 
SRC_Y, SRC_WIDTH1, SRC_HEIGHT1 
If destination is line, then also 
SRC_LINE_X_DIR@SRC_CNTL 


SRC_OFFSET, SRC_PITCH, SRC_X, 
SRC_Y, SRC_WIDTH1, SRC_HEIGHT1, 
SRC_X_START, SRC_Y_START, 
SRC_WIDTH2, SRC_HEIGHT2 

If destination is line, then also 
SRC_LINE_X_DIR@SRC_CNTL 


DST_OFFSET, DST_PITCH, DST_X, 
DST_Y, DST_WIDTH, DST_HEIGHT, 
DST_X_DIR@DST_CNTL, 
DST_Y_DIR@DST_CNTL 


DST_OFFSET, DST_PITCH, DST_X, 

DST_Y, DST_BRES_LNTH, 

DST_BRES_ERR, DST_BRES_INC, 

DST_BRES_DEC, DST_BRES_LNTH 
DST_X_DIR@DST_CNTL, 

DST_Y_DIR@DST_CNTL, 

DST_Y_MAJOR@DST_CNTL 


SRC_LINEAR@SRC_CNTL== 


SRC_PATT_ENA@SRC_CNTL== 
SRC_PATT_ROT@SRC_CNTL== 
SRC_LINEAR@SRC_CNTL==0 


SRC_PATT_ENA@SRC_CNTL==1 | Enables | 
SRC_PATT_ROT@SRC_CNTL== 
SRC_LINEAR@SRC_CNTL== 


SRC_PATT_ENA@SRC_CNTL== 
SRC_PATT_ROT@SRC_CNTL==1 
SRC_LINEAR@SRC_CNTL==0 


DST_WIDTH or 
DST_HEIGHT_WIDTH or 
DST_X_WIDTH 


Destination 
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TRAJECTORIES 


_ Destination Trajectory 1, Rectangular 


DST_OFFSET 


\ 


DST_X 


, 


DSTY —> 


—— pDsT_x_coirR 


DST_Y_DIR DST_HEIGHT 


DST_WIDTH 
$$ _» 


DST_PITCH 


Initiator: DST_WIDTH | | DST_.HEIGHT_WIDTH | |! DST_X_WIDTH 
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TRAJECTORIES 


Destination Trajectory 2, Line 


Initiator: DST_BRES_LNTH 
Comments: The bresenham parameters are calculated as follows... 
DST_BRES_ERR = 2* min(1| dx!,| dy!) - max(/ dx!,I dy!) 
DST_BRES_INC = 2* min(1 dx!,! dy!) 
DST_BRES_DEC = 2* [min(| dx! ,| dy!) - max(] dx!,1 dy!)] 
DST_BRES_LNTH = max(1 dx!,| dy!) +1 
The line drawing pseudocode is: 
if (DST_BRES_ERR < 0) { 
do axial step 
DST_BRES_ERR += DST_BRES_INC 
} else { 
do diagonal step 
DST_BRES_ERR += DST_BRES_DEC 
} 
where the axial and diagonal step directions are determined by the three octant bits, 
The octant bits reside in DST_CNTL. 
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TRAJECTORIES 


Source Trajectory 1, Strictly Linear 


SRC_OFFSET 
SRC_X_DIR 
————»> 
Criterion: SRC_LINEAR@SRC_CNTL==1 
Comments: Source offset and SRC_X_DIR are the only parameters used to set up the source 


trajectory. Pixels are consumed ina strictly linear fashion inmemory. SRC_X_DIR 
tracks DST_X_DIR@DST_CNTL in the case of blits. For lines, SRC_X_DIR equals 
SRC_LINE_X_DIR@SRC_CNTL. SRC_X_DIR should be set to go from left-to- 
right. 


Source Trajectory 2, Unbounded Y 


SRC_OFFSET 


\ 


SRC_X 


’ 


SRCY —> 
——> SRC_X_DIR 


| SRC_Y_DIR 
| | 
| 


SRC_WIDTHI 


et 
SRC_PITCH 
—-—-»+ 
Criterion: SRC_PATT_EN@SRC_CNTL==0 && SRC_PATT_ROT@SRC_CNTL=0 && 


SRC_LINEAR@SRC_CNTL==0 


Comments: If the destination trajectory is rectangular, SRC_X_DIR and SRC_Y_DIR track 
DST_X_DIR@DST_CNTL and DST_Y_DIR@DST_CNTL. For lines, 
SRC_LINE_X_DIR@SRC_CNTL is used and the source trajectory does not 
advance in the Y direction. 
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TRAJECTORIES 


Source Trajectory 3, General Pattern 


SRC_OFFSET 


——> SRC_X_DIR 


| SRC_Y_DIR SRC_HEIGHTI 


SRC_WIDTH1 
SRC_PITCH 
—-—-> 
Criterion: SRC_PATT_EN@SRC_CNTL==1 && SRC_PATT_ROT@SRC_CNTL==0 && 


SRC_LINEAR@SRC_CNTL==0 


Comments: If the destination trajectory is rectangular, SRC_X_DIR and SRC_Y_DIR track 
DST_X_DIR@DST_CNTL and DST_Y_DIR@DST_CNTL. For lines, 
SRC_LINE_X_DIR@SRC_CNTL is used and the source trajectory does not 
advance in the Y direction. 
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TRAJECTORIES 


Source Trajectory 4, General Pattern With Rotation 


SRC_OFFSET 


\ 


SRC_X_START 


SRC_LY_START —> 
——> SRC_X_DIR 


| SRC_Y_DIR SRC_X SRC_HEIGHT2 


‘ 


SCY 


SRC_HEIGHTI 


SRC_WIDTH2 
SRC_WIDTHI 
—— 


SRC_PITCH 


Criterion: SRC_PATT_ROT@SRC_CNTL==1 && SRC_LINEAR@SRC_CNTL== 


Comments: If the destination trajectory is rectangular, SRC_X_DIR and SRC_Y_DIR track 
DST_X_DIR@DST_CNTL and DST_Y_DIR@DST_CNTL. For lines, 
SRC_LINE_X_DIR@SRC_CNTL is used and the source trajectory does not 
advance in the Y direction. 
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SIDE EFFECTS 


Trajectory Modifier 1, SRC_BYTE_ALIGN 


When SRC_BYTE_ALIGN@SRC_CNTL is set, the source pointer skips to the next 
byte boundary when the destination trajectory advances in the Y direction. There 
is a similar bit for host data called HOST_BYTE_ALIGN@HOST_CNTL. These bits 
are only meaningful for Ibpp or 4bpp data. 


Trajectory Modifier 2, DST_POLYGON_ENA 


The DST_POLYGON_ENA affects both lines and blits. 


When drawing a line, only a single pixel is drawn per scan line (this only affects X 
major lines). Horizontal lines are not drawn. Lines whose trajectory goes left of 
the left scissor are saturated to the left scissor. 


When blitting, at the beginning of each destination line, an internal polygon fill flag 
is reset. If the polygon fill flag is reset, drawing is inhibited at the destination. For 
each pixel, an implicit Ibpp polygon boundary source (this is neither a 
monochrome nor a color source, but an implicit third source) is read. If the result 
is ‘1’ (a polygon edge) the polygon fill flag is toggled. Both left and right edges of 
the polygon are inclusive. 


Trajectory Modifier 3, DP_BYTE_PIX_ORDER 


The DP_BYTE_PIX_ORDER@DP_PIX_WIDTH bit affects the pixel order of Ibpp 
and 4 bpp data within a byte. This affects the source area, destination area, and host 
data consumption. When set, pixel order proceeds from least significant bit or 
nibble to most significant bit or nibble within a byte. The bytewise order is 
unaffected. See Host Data Consumption on page 2-35 for a graphical view of the pixel 
ordering. 


SIDE EFFECTS 


* The source pointer is always reset to the original SRC_X, SRC_Y after 
completion of a draw operation. 


* The destination pointer is set according to the DST_X_TILE and DST_Y_TILE 
bits after completion of a blit operation. If DST_X_TILE is set, then DST_X = 
original_DST_X + DST_WIDTH for left-to-right destination trajectories, or 
DST_X = original_DST_X - DST_WIDTH for right-to-left destination 
trajectories; otherwise, it is set to the original DST_X value before the draw 
occurred. Similarly for the DST_Y_TILE bit (with DST_Y and DST_HEIGHT). 


° For lines, the final DST_X, DST_Y rests on the last pixel of the line. The 
LAST_PEL_ON bit specifies whether the last pixel on that line is drawn. 
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SOURCE AND DESTINATION ALIGNMENT 


Sources may have one of two possible alignments: 
* Source alignment 


* Destination alignment. 


Source alignment means that the top left corner of the source area is aligned to the 
top left corner of the destination area, as illustrated below: 


Destination Area 


Source Alignment 


VA First pixel 
Blit 


Source Area > 


Destination alignment to the Nth pixel means that the top left corner of the source 
area is aligned to (X mod N)==0 and (Y mod N)==0: 


Destination Alignment Destination Area 


VA First pixel 
Source Area > 


-| <— CY mod N)== 


‘ 
' 


<— (Y mod N)==0 


t 
' 
1 1 
Ser eS Ms Se ee 
t 
Il 
' 


(X mod N)== (X mod N)== 
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SOURCE AND DESTINATION ALIGNMENT 


Various sources and their implicit alignments are listed below: 


DP_FRGD_CLR Destination aligned 
DP_BKGD_CLR Destination aligned 

Fixed 8x8 mono pattern Destination aligned X8 Y8 
Fixed 4x2 color pattern Destination aligned X4 Y2 
Fixed 8x1 color pattern Destination aligned X8 
Mono host Source aligned 

Color host Source aligned 


Any blit source (strictly linear, | Source aligned 
unbounded Y, general pattern, 
general pattern with rotation) 


The strict definition of source alignment is thata QWORD (or DWORD, depending 
on memory type and size) for a source-aligned source is rotated to align with the 
destination. No rotation occurs for destination-aligned sources. 
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SOURCE AND DESTINATION MIXING LOGIC 


The available mix functions and compare functions are listed in the tables below. 
The ALU will mix the source and destination data to any of the functions listed. 
More complex functions may be accomplished with multiple passes. 


The comparison functions compare a color register against either the destination 
data at the current pixel, or the color source data before processing by the ALU. 


¢ If the result of the comparison is FALSE, the result of the ALU is written to the 
destination; otherwise, the destination data is written to the destination. 


0 [Fase 
Pe i eet 
Pixel == CLR_CMP_COLOR 

| 6 [Reserved 


Function 0x17 additionally requires the DP_CHAIN_MASK register to be set. Each 
‘1’ in the mask will prevent the carry bit from that bit position from adding to the 
next bit. The register should be set according to the following table: 


PRG888GX0-01 © 1993 ATI Technologies Inc. 
2-46 . Proprietary and Confidential 


DRAW ENGINE CONTEXTS 


Chained 
contexts 


DRAW ENGINE CONTEXTS 


The mach64 can save its complete draw engine state to memory. A saved state in 
memory is a context. More than one context can be saved in memory — in fact, the 
number of contexts that can be saved is limited only by the amount of memory on the 
graphics card. Contexts can later be loaded using a single draw engine command. 


The load can be of two types: 
° Load the context. 


* Load the context, and initiate a draw operation. 


Contexts may also be chained together. When a chained context is loaded, each 
individual context is loaded and initiated in turn. A chained context therefore acts 


like a single, atomic draw operation. 


Contexts may be defined and restored from anywhere in screen memory, on any 64 
DWORD boundary. The context structure is shown in the following table: 


A] SRC_HEIGHTT_WioTHT | 
C___| SkC_HEIGHT2_wioTHa__| 
cE _ [PAT REGIS” 
fs, SekeaRemne 
10__| Sc.10P_BOTTOM 
1¢____| CONTEXT_LOAD_CNTE | 
1DaF | Resened 


0 
13 
14 
15 
16 
17 
18 

A 


1 
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DRAW ENGINE CONTEXTS 


Notes: 
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DST_BRES_LNTH is actually aliased to the DST_WIDTH register. 


The CONTEXT_MASK register does not affect the CONTEXT_MASK entry or 
the CONTEXT_LOAD_CNTL entry on loading. 


The CONTEXT_LOAD_CNTL must be set to no-op to stop it from chaining. 
Context save operators are not currently supported. 


Contexts are allocated in reverse order in screen memory. Lower screen memory 
addresses correspond to higher context numbers. For instance, context number 0 
resides at the top of screen memory minus 64 DWORDS, context 1 resides at top of 
memory minus 128 DWORDS, etc. 


Context loads can selectively mask out any DWORD. Bit 0 of the context mask 
register corresponds to DWORD 0 of the context, etc. Setting any bit to zero in the 
context mask will inhibit loading of the corresponding DWORD, except the mask 
bits for CONTEXT_MASK and CONTEXT_LOAD_CNTL are ignored and these 
registers are always loaded. The CONTEXT_MASK entry which is loaded controls 
the remaining DWORDs to be loaded. The CONTEXT_LOAD_CNTL entry must 
be set to no-op to halt a context chain. 


One possible context usage methodology is to set up various contexts for all 
possible “flavors” of draw operation at initialization time. It would be a simple 
matter to load a context, set up the trajectory registers, and initiate the draw 
operation. A load context simplifies the task of setting all the context registers 
manually. See Saving and Restoring a Context in Chapter 3,Simple Draw Operations. 


Contexts may also be used to improve concurrency (see Performance Issues in Chapter 
5, Advanced Topics II). If the draw engine is required to perform many small draw 
operations, these operation contexts may be written directly to screen memory and 
chained together. A single context load initiator is used to execute the entire chain. 

See Context Chains in Chapter 5, Advanced Topics I. 
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HARDWARE CURSOR 


Pseudo color 
modes and 
Direct color 
modes 


The mach64 hardware cursor is similar in function to the mach32 hardware cursor. 
Each cursor pixel is defined by a 2-bit field with the definition below: 


: ing 
| _00 _| Cursorcolor0 | 
| 01 {Cursor color 1 | 


Cursor pitch is always 64 pixels: that is, each scan line of the hardware cursor 
definition is defined with 64*2 bits (16 bytes) of data, regardless of the actual cursor 
width. The pixel definition is specified in Intel order: the first pixel is defined in 
the low-order 2 bits of the low-order byte in memory. Each cursor scan line 
definition resides back-to-back in memory. 


Cursor colors are defined by CUR_CLRO and CUR_CLRI1. Note that for pseudo color 
modes, the colors are specified in color indices, and for direct color modes, the colors 
are specified in 24-bit true color. The meaning of other registers is illustrated below: 


CUR_OFFSET 


Actual 
Displayed 


CUR_HORZ_OFFSET Casae 


64 lines 


CUR_VERT_OFFSET 


The screen position of the top left corner of the displayed cursor is specified by 
CUR_HORZ_VERT_POSN. Care must be taken when the cursor hot spot is not the 
top left corner and the physical cursor position becomes negative. The mach64 will not 
display the cursor at all if either the horizontal or vertical cursor position is negative. 


° If X becomes negative, the cursor manager must adjust the CUR_HORZ_OFFSET 
to a larger number and saturate CUR_HORZ_POSN to zero. 


¢ If Y becomes negative, CUR_VERT_OFFSET must be adjusted to a larger 
number, CUR_OFFSET must be adjusted to point to the appropriate line in the 
cursor definition, and CUR_VERT_POSN must be saturated to zero. 
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HARDWARE CURSOR 


Sample code 


This is sample code to setup a hardware cursor bitmap. 


CURSOR .DOC 


see CURSOR.C and HWCURSOR.C. 


/* 32x32 Cursor bitmap - 32 x 4 
unsigned int cursor32x32[128] = 


{ 


0x0000, 
0x0000, 
0x0000, 
0x0000, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxf£f00, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxff0o0, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxf£00, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxff00, 
Oxff£00, 
Oxff00, 
Oxff00, 
Oxff00, 
0x0000, 
0x0000, 
0x0000, 
0x0000, 


/* Main C program */ 


int main(void) 
{ 


0x0000, 
0x0000, 
0x0000, 
0x0000, 
Oxffff, 
Oxffff, 
Oxffff, 
Oxffff, 
0x5555, 
0x5555, 
0x5555, 
0x5555, 
Oxaa55, 
Oxaa55, 
Oxaa55, 
Oxaa55, 
Oxaa55, 
Oxaa55, 
Oxaa55, 
Oxaa55, 
0x5555, 
0x5555, 
0x5555, 
0x5555, 
Oxffff, 
Oxffff, 
Oxffff, 
Oxffff, 
0x0000, 
0x0000, 
0x0000, 
0x0000, 


words/line */ 


0x0000, 
0x0000, 
0x0000, 
0x0000, 
Oxffff, 
Oxffff, 
Oxffff, 
Oxffff, 
0x5555, 
0x5555, 
0x5555, 
0x5555, 
0x55aa, 
0x55aa, 
Ox55aa, 
Ox55aa, 
0x55aa, 
0x55aa, 
0x55aa, 
0x55aa, 
0x5555, 
ox5555, 
0x5555, 
0x5555, 
Oxffff, 
Oxffff, 
Oxfffrft, 
Oxffff, 
0x0000, 
0x0000, 
0x0000, 
0x0000, 


0x0000, 
0x0000, 
0x0000, 
0x0000, 
Ox00ff, 
Ox00ff, 
Ox00ff, 
Ox00ff, 
Ox00fft, 
Ox00ff, 
Ox00ff, 
OxOOfE, 
Ox00ff, 
Ox00ff, 
Ox00ff, 
Oxo00oft, 
Ox00ff, 
Oox00ff, 
Ox00f£, 
Ox00fft, 
Ox00ft, 
Ox00ff, 
Ox00ff, 
Ox00ff, 
Oxo0oftf, 
Ox00fE, 
Ox00ff, 
Ox00fEt, 
0x0000, 
0x0000, 
0x0000, 
0x0000 


For more details, 
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HARDWARE CURSOR 


The hardware cursor definition requires 2 bits per pixel. The cursor 
size regardless of the visible cursor size is 64 pixels x 64 lines. 
The total memory required to define the cursor is 16 bytes / line 
for 64 lines or 1024 bytes of data. The data must be in a contigious 
format. The 2 bit cursor code values are as follows: 


00 - pixel color = CURSOR_COLOR_0O 
O01 - pixel color = CURSOR_COLOR_1 
10 - pixel color = transparent (current display pixel) 
11 - pixel color = 1’s complement of current display pixel 
-~-- Use 16 bpp to draw cursor data information ---- 
By programming the starting address (‘offset’), all destination draw 


operations will start at this address. This is useful for setting up 
data that is resolution and color depth independent. 


The cursor data area will be first filled with ‘transparent code’ 
data. Cursor data that is transparent will not be visible when the 
hardware cursor is enabled. Secondly, the cursor pattern will be drawn 
to the appropriate area within the cursor data area. Note that the 
cursor data is in linearized format. It can be thought of as a 
rectangle whose lines are joined contigiously together in memory. 


setup a 32x32 hardware cursor - the cursor data will be visible 


set_hwcursor(modeinfo.yres - 10, 32, 32, 


// 


get_color_code(LIGHTBLUE) , // cursor color 0 
get_color_code (YELLOW), // cursor color 1 
cursor32x32); 


set hardware cursor to center of screen 


set_hwcursor_pos((modeinfo.xres / 2) - (32 / 2), 


// 


(modeinfo.yres / 2) - (32 / 2)); 


enable hardware cursor 


enable _hwcursor(); 


/* — om oe 


a ne oS oe oe Oe am om am Ge oe om Om oe oe Oe Oe oe Oe oe me oe oe ae oe oe oe SP oe oe SE oe oe om me oe oe oe oe > oe oe ow oe © © oe Om ee oe om oe oe OF oe oe oe oe OF oe me on om ow oe oe 


SET HWCURSOR - define hardware cursor bitmap 


This function sets up the hardware cursor data region according to the 
given bitmap data. The data region is located at (0, y). 
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HARDWARE CURSOR 


The hardware cursor be vary in size from 1x1 to 64x64. The expected bitmap 
format is LSB to MSB. The LSB will be drawn first in a left to right 
direction. Since the cursor position is NOT set in this routine, the 
position should be set (set_hwcursor_pos) prior to enabling the cursor. 


void set_hwcursor(int y, int width, int height, 
unsigned long color0, unsigned long asters, 
unsigned int *bitmap) 


unsigned long cur_offset, cur_size, cur_pitch; 
unsigned long temp1, temp2, temp3, temp4, temp5, temp6; 
unsigned long redl1, greenl, bluel, red2, green2, blue2; 
unsigned long rshift, gshift, bshift; 

unsigned long bitdata; 

int i, index, dataindex, start, widthwords; 

PALETTE entry; 


// Check that cursor size is within limits 
if ((width < 1) [| (width > 64)) return; 
1f ((height < 1) || (height > 64)) return; 


// set cursor dimensions 
cursordata.y = y; 
cursordata.width = width; 
cursordata.height height; 
cursordata.color0 color0; 
cursordata.colorl = colorl1; 


// set hwecursor bitmap to transparent 
for (index = 0; index < (HWCURHEIGHT * HWCURWIDTH); index++) 
{ 
cursordata.bitmap[index] = Oxaaaa; 
} 


// load user hwcursor data into bitmap 
dataindex = 0; 
widthwords = width / 8; 
1£ (width > widthwords * 8) 
{ 
widthwords++; 
} 
start = HWCURWIDTH - widthwords; 
for (index = start; index < (HWCURWIDTH * height); index = index + HWCURWIDTH) 


cursordata.bitmap[index + i] = *(bitmap + dataindex) ; 
1f (width < 8) 
{ 
cursordata.bitmap [index + i] = cursordata.bitmap[index 4+ i] << 
((8 - width) * 2); 
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HARDWARE CURSOR 


} 
dataindex++; 
1++; 
} while (1 < widthwords) ; 
} 


// calculate offset in bytes 
cur_offset = (unsigned long) (modeinfo.pitch) ; 
if (modeinfo.bpp == 4) 
{ 
cur_offset = (unsigned long) (cur_offset / 2); 
} 
else 
{ 


cur_offset (unsigned long) (cur_offset * (modeinfo.bpp / 8)); 


} 
cur_offset = (unsigned long) (cur_offset * y); 


// calculate cursor pitch (assuming 16 bpp) 
cur_pitch = HWCURWIDTH / 8; 
cur_pitch = cur_pitch << 22; 


// convert byte offset to qword offset and limit to 20 bits (used DWORD 
// format used in DST_OFF_PITCH 
cur_offset = (cur_offset / 8) & Ox000fffff; 


// Use 16 bpp to setup hardware cursor bitmap 


// save vital registers 
wailt_for_idle(); 

templ = regr(DP_PIX_WIDTH) ; 
temp2 = regr(DP_CHAIN MASK); 
temp3 = regr(DST_OFF_PITCH); 
temp4 = regr(DP_SRC); 

temp5 = regr(DP_MIX); 

temp6 = regr(DST_CNTL) ; 


// load bitmap data to hardware cursor bitmap data area 

regw(DP_PIX WIDTH, (temp1 & Oxffc00000) | BYTE_ORDER_LSB_TO_MSB | 
HOST_16BPP | SRC_16BPP | DST_16BPP); 

regw(DP_CHAIN MASK, 0x0410); // chain mask for 16 bpp 

regw(DST_OFF_PITCH, cur_pitch | cur_offset); 

regw(DP_SRC, FRGD_SRC_HOST); 

regw(DP_MIX, FRGD_MIX_S | BKGD_MIX D); 

regw(DST_CNTL, DST_Y_TOP_TO BOTTOM | DST_X_ LEFT TO RIGHT); 

regw(DST_X, 0); 

regw(DST_Y, 0); 

regw(DST_HEIGHT, HWCURHEIGHT) ; 

regw(DST_WIDTH, HWCURWIDTH) ; 

for (index = 0; index < (HWCURHEIGHT * HWCURWIDTH * 2); index = index + 2) 

{ 

wait_for_fifo(1); 
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bitdata = (unsigned long) (cursordata.bitmap [index+1] ) ; 
bitdata = (bitdata << 16) | (cursordata.bitmap [index] ); 
regw(HOST DATAO, bitdata) ; 


} 
wait_for_ idle(); 


// set cursor size offsets 

cur_size = (unsigned long) (64 - height); 

cur_size = (unsigned long) ((cur_size << 16) | (64 - width)); 
regw(CUR_HORZ _VERT_OFF, cur_size); 


// set cursor colors 
if (modeinfo.bpp > 8) 


{ 
// for 15, 16, 24, 32 color modes 
switch (modeinfo.bpp) 
{ 
case 16: 
if (modeinfo.depth == 555) 
{ 
rshift = 3; 
gshift = 3; 
bshift = 3; 
} 
else // 565 weight 
{ 
rshift = ? 
gshift = 2; 
bshift = 3; 
} 
break; 
case 24: 
case 32: 
rshift = 0; 
gshift = 0; 
bshift = 0; 
break; 
} 
// cursor color 0 
redl = get_primary_color(RED, color0) << rshift; 
greenl = get_primary_color(GREEN, color0O) << gshift; 
bluel = get_primary_color(BLUE, color0) << bshift; 
// cursor color 1 
red2 = get_primary_color(RED, colorl) << rshift; 
green2 = get_primary_color(GREEN, colorl) << gshift; 
blue2 = get_primary_color(BLUE, colorl1) << bshift; 
if (querydata.dac_type != DAC_ATI68860) 
{ 
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// standard setup for other dacs 
regw(CUR_CLRO, (red1 << 24) | (greenil << 16) | (bluel << 8)); 
regw(CUR_CLR1, (red2 << 24) | (green2 << 16) | (blue2 << 8)); 


} 
else 
{ 
// for 4, 8 bpp color modes 
if (querydata.dac_type == DAC_ATI68860) 
{ 
entry = get_palette((int) (color0O & Oxff)); 
redl = (unsigned long) (entry.red); 
greenl = (unsigned long) (entry.green) ; 
bluel = (unsigned long) (entry.blue) ; 
entry = get_palette((int) (colorl & Oxff)); 
red2 = (unsigned long) (entry.red); 
green2 = (unsigned long) (entry.green) ; 
blue2 = (unsigned long) (entry.blue) ; 
} 
else 
{ 
regw(CUR_CLRO, color0O & Oxff); 
regw(CUR_CLR1, colorl & Oxff); 


} 


if (querydata.dac_type == DAC_ATI68860) 

{ 
// special setup for ATI68860/880 dac for cursor colors 
outp (LioDAC_CNTL, 1); 
outp(LoODAC_REGS, 0); 


outp(LoODAC_REGS+1, (int) red1); 
outp(LoDAC_REGS+1, (int) greenl1); 
outp (LoDAC_REGS+1, (int)bluel); 


outp (LoDAC_REGS+1, (int) red2); 
outp(LoODAC_REGS+1, (int) green2) ; 
outp (LoDAC_REGS+1, (int)blue2); 


outp(LoDAC_CNTL, 0); 
} 


// set offset to cursor data region 
regw(CUR_OFFSET, cur_offset) ; 


// restore vital registers 
regw(DP_PIX WIDTH, temp1); 
regw(DP_CHAIN_ MASK, temp2) ; 
regw(DST_OFF_PITCH, temp3) ; 
regw(DP_SRC, temp4); 
regw(DP_MIX, temp5); 


© 1993 ATI Technologies Inc. PRG888GX0-01 
Proprietary and Confidential 2-55 


HARDWARE CURSOR 


regw(DST_CNTL, temp6); 


} 
/* ‘leds et. dah tt ik U6: eS. sme ty Se bina, Sc Gi. il tal‘ Su “in: ea msi gai lin eee iS Ses‘ ln”: Se Gl Set 
ENABLE_HWCURSOR - turn on the hardware cursor 
(seen geen‘ af thn: ne ‘oe! si,‘ ns pS: sew. “i em Gs: “i. NS Sc gh eS sm iin‘ eS Sn. eee sh */ 
void enable_hwcursor (void) 
{ 
// enable hardware cursor 
outpw(ioGEN_TEST_CNTL, inpw(1oGEN_TEST_CNTL) | 0x80); 
} 
/* eae 00” fem Gm cab Gre cane tate GD EAD CAD MED ND GED Wy GamE GEES SED GD GED GND ‘CU GUD GID GD (US GD Cam Ont) OED MimD cE GO OED OOD OOD GE ED OE GED SI OO OOD OE Ate OD OD OND OND MibD MD OD Mas Ma ED OD OD Om MD Om We win dee am Om OS OD mS we Oe oe oe oe 
SET_HWCURSOR_POS - set the hardware cursor position 
cme te ca com cm oe GD one OD nD OO Coe Oe cies mm Om cme mm cm OS mS OD mae Gem OD om ee ee oe ee ee eae ec ee ee oe ome me es ne oe ee ee ne Om om mee OS Om a a os */ 
void set_hwcursor_pos(int x, int y) 
{ 4 
unsigned long cur_pos; 
// check for coordinate violations 
1f (x < 0) x = O; 
1f (y < 0) y = O; 
// set cursor position 
cur_pos = (unsigned long)y; 
cur_pos = (unsigned long) ((cur_pos << 16) | x); 
regw(CUR_HORZ_VERT_POSN, cur_pos); 
} 
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DRAW OPERATIONS 


There are two draw operations available on the maché4: lines and rectangle fills 
(blits are a subset of rectangle fills). 


* Blits are initiated by writing to the DST_.WIDTH, DST_X_WIDTH, or 
DST_HEIGHT_WIDTH registers. 


¢ Lines are initiated by writing to the DST_BRES_LNTH register. Lines or blits 
may also be initiated by a context load command. 


Some possible programming operating styles are listed below: 


Style 1, Drawing Without Using Contexts 


1. Set up the draw engine context manually by writing to all the appropriate 
registers. A summary of these registers can be found in Appendix E, Register 
Summary. 


2. Set up the draw trajectory by writing to the trajectory control registers. 


Initiate the draw operation by writing to the appropriate draw initiator 
register. 


4. Repeat steps 1 to 3 for each draw operation. 


Sample code STYLE1.DOC 


This is sample code to show programming style 1. 


1. Setup draw engine context manually 

2. Setup draw trajectory of draw operation 

3. Initiate draw operation by writing to the appropriate register 
4. Repeat steps 1 to 3 for each operation 


See STYLE1.C and INIT.c for more details. 


// Step 1: Setup draw engine context manually -- and clear screen 
init _engine(); 
clear_screen(0, 0, modeinfo.xres, modeinfo.yres) ; 


// Step 2: Setup draw trajectory of draw operation - i.e. rectangle fill 


// araw a filled rectangle 

wait_for_fifo(5); 

regw(DP_FRGD_CLR, get_color_code(LIGHTGREEN) ) ; 
regw(DST_X, 0); 

regw(DST_Y, 0); 

regw(DST_HEIGHT, modeinfo.yres / 4); 


// Step 3: Initiate the draw operation - DST_WIDTH is a draw initiator 
regw(DST_WIDTH, modeinfo.xres / 4); 
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Style 2, Drawing Using Restored Contexts 


1. At initialization time, reserve some screen memory, and save all possible 
contexts to that area. Contexts may be saved by writing directly to screen 
memory. | 


Load desired default draw context. 
Set up trajectory registers. 


Initiate the draw operation. 


ee ee 


Repeat steps 2 to 4 for each draw operation. 


Sample code STYLE2.DOC 


This is sample code to show programming style 2. 


1. At initialization, store engine context(s) at appropriate addresses. 
2. Load desired default draw context. 

3. Setup draw trajectory registers. 

4. Initiate the draw operation. 

5. Repeat steps 2 to 4 for each operation 


See STYLE2.C for more details. 


unsigned long context [64]; // context array 


// Step 1: Store engine context at appropriate address to load later 


// determine top of memory address 
memcntl = inpw(ioMEM_CNTL) ; 
switch(memecntl & 7) 


{ 
case 0: contextaddr = 0x80000; break; // 512K 
case 1: contextaddr = 0x100000; break; // 1M 
case 2: contextaddr = 0x200000; break; // 2M 
case 3: contextaddr = 0x400000; break; // 4M 
case 4: contextaddr = 0x600000; break; // 6M 
case 5: contextaddr = 0x800000; break; // 8M 
} 
/ ® 
Fill context for context pointer 4. Each context pointer represents 
256 bytes of video memory. The context load address decreases as the 
context load pointer increases. Context pointer 0 points to the top of 
memory - 256. Some restrictions apply for context pointers 0-3 if the 
linear aperture size equals the memory size (i.e. 4M aperture size, 
4M of video memory). In this case, the memory mapped registers occupy 
1K of memory below the top of aperture. The only method to reach this 
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area in this case is to use the VGA paged aperture. 
*/ 
context [0] = Oxffffffff; 
context[1] = 0x00000000; 
context[{2] = (pitch / 8) << 22; // destination pitch 
context [3] = 0x00000000; 
context[4] = 0x00000000; 
context [5] = 0x00000000; 
context [6] = 0x00000000; 
context [7] = 0x00000000; 
context [8] (pitch / 8) << 22; // source pitch 
context[9] = 0x00000000; 
context[10] = 0x00000000; 
context[11] = 0x00000000; 
context [12] = 0x00000000; 
context[13] = 0x00000000; 
context[14] = 0x00000000; 


context [15] 
context [16] 
context [17] 
context [18] 
context [19] 


xres << 16; 
yres << 16; 


= 0x00000000; 
= Oxfffrrrtt; 


Oxfffrr¢rr¢rt; 


// scissors 


// set DP_PIX_WIDTH and CHAIN MASK according to color depth 
switch (modeinfo.bpp) 
{ 
case 4: 
context [20] 
context[21] = 
break; 
case 16: 
if (modeinfo.depth == 555) 
{ 


0x00008888; 


HOST_4BPP | SRC_4BPP | DST_4BPP; 


// 555 color weighting 
context [20] = 0x00004210; 
context [21] = HOST_1I5BPP | SRC_15BPP | DST_15BPP; 
} 

else 

{ 


// 565 color weighting 


context [20] = 0x00000410; 
context [21] = HOST_16BPP | SRC_16BPP | DST_16BPP; 
} 
break; 
case 32: 
context [20] = 0x00008080; 
context [21] = HOST _32BPP {| SRC_32BPP | DST_32BPP; 
break; 
default: 
case 8: 
case 24: 
context [20] = 0x00008080; 
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context[21] = HOST_S8BPP | SRC_S8BPP | 
break; 

} 

context [22] = FRGD_MIX S | BKGD_MIX_S; 


DST_8BPP; 


context [23] = FRGD_SRC_FRGD CLR; 

context [24] = 0x00000000; 

context [25] = OxfffffffFf; 

context [26] = 0x00000000; 

context [27] = DST_Y_TOP_TO_BOTTOM | DST_X LEFT TO_RIGHT; 
context [28] = 0x00000000; 

for (1 = 29; 1 < 64; 14+) 


{ 

context[i] = 0; 
} 
// Upload contexts 


/* 


The main focus here is to upload the context information for later 
retrieval by the engine context load feature. The method used will 
depend on the hardware setup - see STYLE2.C for more details. 


// Context load address calculation: 


// Address = total video memory - 


(context pointer + 1) * 0x100) 


// load up first context -> load pointer = 4 


upload_context (contextaddr - 


((4 + 1) * 0x100)); 


// Step 2: Load default context (load context pointer 4) 


// load all GUI registers 
walt_for_fifo(2); 
regw(CONTEXT_MASK, Oxffffffff); 
// load using context pointer 4 
regw(CONTEXT LOAD CNTL, 


// Step 3: Setup draw trajectory of draw operation - i.e. rectangle 


// set foreground color 
wailt_for_fifo(5); 


regw(DP_FRGD_CLR, 


// draw a filled rectangle 
regw(DST_X, modeinfo.xres / 32); 
regw(DST_Y, modeinfo.yres / 32); 


regw(DST_HEIGHT, modeinfo.yres / 4); 


CONTEXT_LOAD 


| 4); 


fill 


get_color_code(LIGHTCYAN) ) ; 


// Step 4: Initiate the draw operation - DST_WIDTH is a draw initiator 


regw(DST_WIDTH, modeinfo.xres / 2); 
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Style 3, Drawing Using Context Chains 


1. 


Sample code 


This is sample code to show programming style 3. 


DRAW OPERATIONS 


Configure all draw operations by writing context chains directly to screen 


memory. 


Do a load context with a draw initiate. 


Repeat steps 1 to 2 for each sequence of draw operations. 


STYLE3 . DOC 


1. Store pre-set engine draw context at appropriate address. 
2. Load pre-set engine draw context with a draw initiate command. 
3. Repeat steps 1 to 2 for each operation 


See STYLE2.3 for more details. 


unsigned long context [64]; 


// Step 1: Store engine context at appropriate 


// context array 


// determine top of memory address 
mementl = inpw(10oMEM _CNTL) ; 
switch(memcntl & 7) 


{ 
case 0: 
case 1: 
case 2: 
case 3: 
case 4: 
case 5: 


/* 


contextaddr = 


contextaddr 
contextaddr 
contextaddr 
contextaddr 


contextaddr = 


0x80000; break; 
0x100000; break; 
0x200000; break; 
0x400000; break; 
0x600000; break; 
0x800000; break; 


// 
// 
// 
// 
// 
// 


address to load later 


512K 
1M 
2M 
4M 
6M 
8M 


Fill context for context pointer 4. Each context pointer represents 
256 bytes of video memory. The context load address decreases as the 
context load pointer increases. Context pointer 0 points to the top of 


Some restrictions apply for context pointers 0-3 if the 


linear aperture size equals the memory size (i.e. 4M aperture size, 
4M of video memory). In this case, the memory mapped registers occupy 
1K of memory below the top of aperture. The only method to reach this 
this case is to use the VGA paged aperture. 


area in 
ay 
context [0] 
context [1] 
context [2] 


Oxffffrfrfef; 
0x00000000; 
(pitch / 8) 
temp = (unsigned long) (modeinfo.xres / 32); 
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context[3] = (temp << 16) | (modeinfo.yres / 32); // 
temp = (unsigned long) (modeinfo.xres / 8); 

context [4] = (temp << 16) | (modeinfo.yres / 4); // 
context [5] = 0x00000000; 

context[6] = 0x00000000; 

context [7] = 0x00000000; 

context[8] = (pitch / 8) << 22; // 
context [9] = 0x00000000; 

context[10] = 0x00000000; 

context [11] 0x00000000; 

context [12] = 0x00000000; 

context [13] = 0x00000000; 

context [14] 0x00000000; 

context[15] = xres << 16; // 
context[16] = yres << 16; 

context [17] = 0x00000000; 

context[18] = get_color_code(LIGHTBLUE) ; // 
context[19] = Oxffrrffrtf; 


(x, y) 


(width, height) 


source pitch 


scissors 


foreground color 


// set DP_PIX WIDTH and CHAIN_MASK according to color depth 
switch(modeinfo.bpp) 
{ 
case 4: 
context [20] = 0x00008888; 
context [21] = HOST _4BPP | SRC_4BPP | DST_4BPP; 
break; 
case 16: 
if (modeinfo.depth == 555) 
{ 
// 555 color weighting 
context [20] = 0x00004210; 
context [21] = HOST_15BPP | SRC_1I5BPP | DST_15BPP; 
} 
else 
{ 
// 565 color weighting 
context [20] = 0x00000410; 
context [21] = HOST_16BPP | SRC_16BPP | DST_16BPP; 
} 
break; 
case 32: 
context [20] = 0x00008080; 


context [21] = 


HOST_32BPP | SRC_32BPP | 


break; 


default: 
case 8: 
case 24: 


context [20] = 
context [21] = 


0x00008080; 
HOST_8BPP | SRC_8BPP | 


break; 


DST_8BPP; 


DST_32BPP; 
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context [22] = FRGD_MIX_S | BKGD_MIX_S; 
context [23] = FRGD_SRC_FRGD CLR; 
context [24] = 0x00000000; 
context [25] = Oxffffffff; 
context [26] = 0x00000000; 
context [27] = DST_Y_TOP_TO BOTTOM | DST_X LEFT TO RIGHT; 
context [28] = 0x00000000; 
for (1 = 29; i < 64; 14+) 
{ 
context[i] = 0; 


// Upload contexts 


The main focus here is to upload the context information for later 
retrieval by the engine context load feature. The method used will 
depend on the hardware setup - see STYLE2.C for more details. 

ba & 


// Context load address calculation: 

// 

// Address = total video memory - (context pointer + 1) * 0x100) 
// 


// load up first context -> load pointer = 4 
upload_context (contextaddr - ((4 + 1) * 0x100)); 


// Step 2: Load context with blit draw initiate (load context pointer 4) 
// load all GUI registers 

wait _for _fifo(2); 

regw(CONTEXT MASK, Oxfffffffl); 


// load & initiate using context pointer 4 
regw(CONTEXT LOAD CNTL, CONTEXT_LOAD AND DO FILL | 4); 
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REMARKS ON PIXEL DEPTH 


PRG888GXO0-01 
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Not all pixel depths are created equal: 
¢ 1 bpp mode is supported by the drawing engine, but not by the CRT 


controller. Therefore, 1 bpp mode can only be used in off-screen memory. 


* Pitch is normally specified in multiples of 8. An additional restriction is that it 


must also fall on a 64-bit boundary. That implies that pitch for Ibpp mode 
must be a multiple of 64, and pitch for 4bpp mode must be a multiple of 16. 


¢ The DP_BYTE_PIX_ORDER@DP_PIX_WIDTH bit only affects pixel ordering 


within a byte. Therefore, only lbpp and 4bpp modes are affected. 


¢ All pixel depths above 8bpp are direct color modes. 4bpp and 8bpp modes are 


pseudocolor modes. 


° Packed 24 bpp mode is actually 24bpp CRTC mode and 8bpp draw mode with 


special rotations done on DP_FRGD_CLR, DP_BKGD_CLR, 
DP_WRITE_MASK, and fixed 8x8 mono patterns. See Drawing in Packed 24 
Bit Per Pixel Modes in Chapter 4, Advanced Topics I. 


* DP_CHAIN_MASK must be manually set for the destination pixel depth (this 


register only affects the mix function 0x17, the averaging function). The 
following table lists the settings: 


N/A 


ee ee es 
4 0x8888 
0x8080 


8 (RGB 332) 0x9292 


15 (RGB 555 
16 (RGB 565 
packed 24 

z 


* 15bpp and 16bpp modes are identical draw modes, but different DAC modes 


must be set (use BIOS services for mode switching so the application doesn’t 
have to handle it). 15bpp mode is always RGB 555, and 16bpp mode is always 
RGB 565. 


¢ Although pixel depths for source area, destination area, and host may be set 


independently, the only pixel depth conversion available is 1bpp to any pixel 
depth monochrome expansion. Behavior is undefined for any other mixing 
and matching of pixel depths. 
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VGA INTERACTION 


Remember that physical memory is shared between the on-chip VGA and the 
accelerator. A logical boundary may be enabled with the MEM_CNTL register to 
inhibit the two logical devices from accessing the other’s memory. 


¢ When the memory boundary is disabled, each device has full access to on- 
board memory. 


¢ When the memory boundary is enabled, any memory accesses through the 
VGA aperture are inhibited. All draw engine functions that access the 
memory below the boundary are inhibited. The boundary may be set to Zero. 
Remember to set all draw engine offsets above the memory boundary. 


¢ Memory accesses through the big linear aperture are not affected by the 
memory boundary register. 


¢ If the application destroys VGA memory, the application must re-initialize the 


VGA mode before exiting. 
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SAVING AND RESTORING A CONTEXT 


Context pointers begin at the high address in screen memory and proceed 
downward. This facilitates any stack-based implementations of context saving and 
restoring. To calculate the byte address of a context load location: 


byte_address = ((memory_size / 256 - 1) - context_number) * 256 


The exact mapping of a context block is shown in Draw Engine Contexts in Chapter 2, 
Programming Model. Sample code is provided in Draw operations in Chapter 2, 
Programming Model. 


RECTANGLE FILL 


A rectangle fill is a draw operation that has a rectangular destination trajectory. 
Any source may be used. 


Sample code 


This is sample code to draw a filled rectangle. See RECT.C and DRAW.C for 
more details. 


// Draw a filled rectangle at (x, y) of size (width x height) using 
// the foreground color as the source. 


wait_for_fifo(7); 

regw(DP_FRGD_CLR, get_color_code(LIGHTBLUE)); // color depth independent 
regw(DP_SRC, FRGD_BLIT_SRC); 

regw(DST_CNTL, DST_Y_TOP_TO_BOTTOM | DST_X_LEFT TO RIGHT) ; 


regw(DST_X, x); 
regw(DST_Y, y); 
regw(DST_HEIGHT, height); 
regw(DST_WIDTH, width); 
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BITBLT 


A bitblt is a rectangle fill that specifically uses a color blit source. There are four 
types of blit source trajectory, as described in Trajectories in Chapter 2, Programming 


Cm Model. Note that the source trajectory direction always tracks the destination 


trajectory direction. Blit sources are always source-aligned. 


Simple One-To-One Bitblt 


Sample code SBLIT.DOC 


This is sample code to show a simple one-to-one blit. See SBLIT.C and DRAW.C 
for more details. 


// Draw some filled rectangles which will be used as the blit source at 
// (srex, srey) of size (srewidth, srcheight). These are resolution 
// independent. 


set_fg_color(get_color_code (WHITE) ); 
draw_rectangle(srcex, srcy, srewidth, srcheight) ; 


set_fg_color(get_color_code(LIGHTRED) ); 
draw_rectangle(srcx + (modeinfo.xres / 32), 
srey + (modeinfo.yres / 32), 
srewidth - (2 * (modeinfo.xres / 32)), 
ercheight - (2 * (modeinfo.yres / 32))); 


set_fg_color(get_color_code(LIGHTBLUE) ); 
draw_rectangle(srcex + (modeinfo.xres / 16), 
srcy + (modeinfo.yres / 16), 
srewidth - (2 * (modeinfo.xres / 16)), 
srcheight - (2 * (modeinfo.yres / 16))); 


// Perform a simple one-to-one color blit 
wait_for_fifo(11) ; 


regw(DP_SRC, FRGD_SRC_BLIT) ; 
regw(SRC_CNTL, 0); 
regw(DST_CNTL, DST_Y_TOP_TO_BOTTOM | DST_X_LEFT_TO_RIGHT) ; 


regw(SRC_X, srcx); 
regw(SRC_Y, srcy); 
regw(SRC_HEIGHT1, srcheight) ; 
regw(SRC_WIDTH1, srcwidth) ; 


regw(DST_X, dstx); 
regw(DST_Y, dsty); 

regw(DST HEIGHT, srcheight) ; 
regw(DST_WIDTH, srewidth) ; 
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General Pattern 


Sample code GPBLIT.DOC 


BITBLT 


This is sample code to show a general pattern blit. See GPBLIT.C for more 


details. 


// Setup dimensions of source and destination 


x1 = modeinfo.xres / 64; // source 
yl = modeinfo.yres / 48; 

widthl = modeinfo.xres / 16; 

height1 = modeinfo.yres / 12; 


x2 modeinfo.xres / 8; // destination 
y2 modeinfo.yres / 16; . 
width2 = modeinfo.xres / 4; 

height2 = modeinfo.yres / 3; 


// draw some filled rectangles which will be used as the blit source 


set_fg_color(get_color_code(WHITE) ); 
draw_rectangle(x1, yi, widthi, height1); 


set_fg_color(get_color_code(LIGHTBLUE) ); 
draw_rectangle(x1+4, y1+4, width1-8, height1-8); 


// Draw a general pattern blit: 
// 


// The general pattern blit allows the source to wrap in the X & Y 


// a@irections while continuing to draw to the destination 


// 


// set source to blit 
wait_for_fifo(11); 
.regw(DP_SRC, FRGD_SRC_BLIT); 


// use general pattern for source 
regw(SRC_CNTL, SRC_PATTERN_ENABLE) ; 
regw(DST_CNTL, DST_Y_TOP_TO BOTTOM | DST_X_ LEFT TO RIGHT); 


// perform general pattern blit: 
// 


// note that destination rectangle is larger than source rectangle 


// 


regw(SRC_X, x1); 
regw(SRC_Y, yl); 
regw(SRC_HEIGHT1, height1); 
regw(SRC_WIDTH1, width1) ; 


regw(DST_X, x2); 
regw(DST_Y, y2); 
regw(DST_HEIGHT, height2); 
regw(DST_WIDTH, width2); 
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General Pattern With Rotation 


Sample code GPRBLIT.DOC 


See source file GPRBLIT.C for more details. 


// Draw a patterned source made up from several filled rectangles 
draw_rectangle(10, 10, 40, 40, WHITE); 

draw_rectangle(14, 14, 32, 32, LIGHTBLUE) ; 

draw_rectangle(18, 18, 24, 24, LIGHTGREEN) ; 

draw_rectangle(22, 22, 16, 16, DARKRED); 


// draw a general pattern blit: 

// the general pattern blit allows the source to wrap in the X & Y 
// @drections while continuing to draw to the destination 
waitforfifo(15); // wait for 15 fifo entries 

// set source to blit 
regw(DP_SRC, FRGD_SRC_BLIT) ; 

// use general pattern with rotation for sre 
regw(SRC_CNTL, SRC_ROTATION_ENABLE | SRC_PATTERN_ENABLE) ; 
regw(SRC_X, 18); 
regw(SRC_Y, 18); 
regw(SRC_HEIGHT1, 24); 
regw(SRC_WIDTH1, 24); 
regw(SRC_X START, 10); 
regw(SRC_Y_ START, 10); 
regw(SRC_HEIGHT2, 40); 
regw(SRC_WIDTH2, 40); 

// wrap source 4 times in X & Y directions 

// with source pattern rotation 
regw(DST_CNTL, DST_Y_TOP_TO_ BOTTOM | DST_X_LEFT _TO_RIGHT); 
regw(DST_X, 100); 
regw(DST_Y, 50); 
regw(DST_HEIGHT, 160); 
regw(DST_WIDTH, 160); 
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Monochrome Expansion Bitblt 


Sample code MEBLIT.DOC 


This is sample code to show a monochrome to color expansion blit. See 
MEBLIT.C for more details. 


// Setup source and destination coordinates and sizes 
dstx = 0; 

dsty = 4; 

dstwidth = 16; 

dstheight = 20; 


srcwidth = dstwidth * dstheight; 
srcheight = 1; 


/* ---- Setup a monochrome source pattern ---- */ 


// This is done by drawing host data linearly in memory. The pattern will 
// be placed at (0, 0) so that it is visible on the screen. The linear 

// source data will be monochrome expanded into a 16x20 rectangle (320 

// bits). A “1” bit in the source data will be expanded to the foreground 
// color and a “0” will be expanded to the background color. The 

// resultant pattern will be a blue box with a white diamond and border. 


// monochrome host, src, dst 
wait_for _fifo(2); 
regw(DP_PIX WIDTH, HOST_1BPP | SRC_1IBPP | DST_1IBPP); 


// sre = host data 
regw(DP_SRC, FRGD_SRC_HOST) ; 


// overpaint mix for src, dst 
set_fg_mix(S_MIX); 
set_bg_mix(S_MIX) ; 


// setup source rectangle to be filled with monochrome host data 
wait_for_fifo(4); 

regw(DST_X, 0); 

regw(DST_Y, 0); 

regw(DST_HEIGHT, 1); 

regw(DST_WIDTH, 320); 


// copy host data to memory - pattern order is LSB to MSB 
wait_for_fifo(10); 

regw(HOST_DATAO, Ox8001FFFF) ; 

regw(HOST DATAO, 0x81818181); 
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regw(HOST_DATAO, 0x83C183C1); 
regw(HOST_DATAO, 0x87E187E1); 
regw(HOST_DATAO, Ox9FF98FF1) ; 
regw(HOST_DATAO, Ox8FF19FF9) ; 
regw(HOST_DATAO, 0x87E187E1) ; 
regw(HOST_DATAO, 0x83C183C1) ; 
regw(HOST_DATAO, 0x81818181); 
regw(HOST _DATAO, OxFFFF8001); 


// insure host transfer is done 
wait_for_idle(); 


// use 8 bpp for this example 
regw(DP_PIX_WIDTH, BYTE_ORDER_LSB_TO _MSB | 
HOST_1BPP | SRC_IBPP | DST_S8BPP); 


// blit host, sre = frgd 
regw(DP_SRC, MONO_SRC_BLIT | FRGD_SRC_FRGD_CLR) ; 


// foreground color = WHITE 
set_fg color(get_color_code(WHITE) ); 


// background color = LIGHTBLUE 
set_bg color(get_color_code(LIGHTBLUE) ) ; 


/* ---- Do monochrome expansion blit ---- */ 


// sre data is in linear format 
wait_for_fifo(10); 

regw(SRC_CNTL, SRC_LINEAR_ENABLE) ; 
regw(SRC_X, srcx); 

regw(SRC_Y, srcy); 
regw(SRC_HEIGHT1, srcheight); 
regw(SRC_WIDTH1, srcewidth) ; 


regw(DST_CNTL, DST_Y_TOP_TO_BOTTOM | DST_X_LEFT_TO_RIGHT) ; 
regw(DST_X, dstx); 

regw(DST_Y, dsty); 

regw(DST_HEIGHT, dstheight) ; 

regw(DST_WIDTH, dstwidth) ; 
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Line Patterns 


When the destination trajectory is a line, the source trajectory behaves in almost the 
same fashion as for a rectangular destination trajectory. The only differences are: 


¢ The source trajectory never advances in the Y direction (the source height is 
implicitly equal to one). 


* The source trajectory X direction is independent of the destination X direction, 
and is settable by SRC_LINE_X_DIR@SRC_CNTL. 


Sample code GPLINE .DOC 


This is sample code to show drawing a line using a pattern (general pattern 
line). See GPLINE.C for more details. 


// Setup line source, destination, and lengths 


// source line (a rectangle with a height of 1) at (srcex, srcy) 
srcex = 0; 

srey = 0; 

srec_line_length = modeinfo.xres / 20; 


// first line starting coordinate at (dstx1, dstyl) 
dstx1 = modeinfo.xres / 40; 
dastyl = modeinfo.yres / 30; 


// second line starting coordinate at (dstx2, dsty2) 
dstx2 = modeinfo.xres / 20; 
dsty2 = modeinfo.yres / 30; 


// destination lines length 
dst_line length = modeinfo.xres / 6; 


/* ---- Setup source line (rectangle with height of 1) ---- */ 


// Setup a color source pattern from host data. The number of host data 
// writes will depend on the current pixel depth. Also, the host data 
// must be packed into 32 bit pieces (1.e. for 8bpp modes, each host 

// write draws 4 pixels). 


// Setup a rectangle fill with host data 
wait_for_fifo(6); 

regw(DST_CNTL, DST_Y_TOP_TO BOTTOM | DST_X_LEFT_TO_RIGHT) ; 
regw(DP_SRC, FRGD_SRC_HOST) ; // sre = host data 
regw(DST_X, srcx); 

regw(DST_Y, srcy); 

regw(DST_HEIGHT, 1); 

regw(DST_WIDTH, src_line_length) ; 
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// Fill source rectangle (height of 1) with packed host data - this 
// function draws a line of source data of alterating colors - see 
// GPLINE.C for more information. 


host_fill(src_line_ length) ; 


// insure that host transfer is done 
wait_for_idle(); 


// Draw a diagonal line using data from the source. When the src data 
// runs out, the source pointer will wrap and repeat according to 

// the source x direction line bit in SRC_CNTL. Both directions are 
// demonstrated. 


/* ---- Draw line using LEFT TO RIGHT source x direction ---- */ 


wait_for_fifo(13); 

regw(DP_SRC, FRGD_SRC_BLIT) ; // sre = blit 
regw(SRC_CNTL, SRC_LINE_X LEFT TO RIGHT | SRC_PATTERN_ENABLE) ; 
regw(SRC_X, srcx); 

regw(SRC_Y, srcy); 

regw(SRC_HEIGHT1, 1); 

regw(SRC_WIDTH1, src_line_length) ; 


// draw a diagonal line using blit sre 

regw(DST_CNTL, DST_LAST_ PEL | DST_Y_TOP_TO_BOTTOM | DST_X_LEFT TO RIGHT); 
regw(DST_BRES_ERR, 0); 

regw(DST_BRES_INC, 1); 

regw(DST_BRES_ DEC, Ox3FFFF); 

regw(DST_ X, dstx1); 

regw(DST_Y, dsty1); 

regw(DST_BRES_ LNTH, dst_line_ length) ; // sre data will repeat 


/* ---- Draw line using RIGHT TO LEFT source x direction ---- */ 


wait for _fifo(6); 

regw(SRC_CNTL, SRC_LINE_X RIGHT TO LEFT | SRC_PATTERN_ENABLE) ; 
regw(SRC_X, src_line_length - 1); 

regw(SRC_Y, srcy); 


regw(DST_X, dstx2); 
regw(DST_Y, dsty2); 
regw(DST_BRES_LNTH, dst_line_ length); 
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FIXED PATTERNS 


Three types of fixed pattern are available: 
¢ 4x2 color pattern 
° 8x1 color pattern 
* 8x8 monochrome pattern. 


The fixed color patterns are only supported in 8bpp mode. Fixed patterns are 
always destination-aligned. See Pattern Consumption in Chapter 2, Programming 
Model for a depiction of pattern consumption. 


Sample code FIXPAT.DOC 


This is sample code to show the three fixed patterns available on the 
Mach 64. Note that the 4x2 and 8x1 pattern modes are only available in 
8 bpp accelerator modes. See FIXPAT.C for more details. 


/* ---- 8x8 mono pattern ---- */ 
wait_for_fifo(1l); 


// host = 8x8 pattern, sre = foreground clr 
regw(DP_SRC, MONO_SRC_PATTERN | FRGD_SRC_FRGD_CLR) ; 
regw(DP_MIX, FRGD_MIX S | BKGD_MIX_S); 


// enable 8x8 mono pattern 
regw(PAT_CNTL, PAT MONO 8x8_ENABLE) ; 
regw(PAT_REGO, Ox818181FF); 
regw(PAT REG1, OxFF818181); 


// f£rgd color used when pattern bit is “1” 
// bkgd color used when pattern bit is “0” 


regw(DP_FRGD_CLR, get_color_code(WHITE) ); 
regw(DP_BKGD_CLR, get_color_code(LIGHTBLUE) ); 


regw(DST_X, 0); 
regw(DST_Y, 0); 
regw(DST_HEIGHT, 16); 
regw(DST_WIDTH, 16); 


/* ---- 4x2 color pattern ---- */ 
wait_for_fifo(8); 


// sre = color pattern 
regw(DP_SRC, FRGD_SRC_PATTERN) ; 


// enable 4x2 color pattern 
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regw(PAT_CNTL, PAT _CLR_4x2_ ENABLE) ; 


// 0x09 - LIGHTBLUE, Ox0F - WHITE 
regw(PAT_REGO, Ox0F09090F) ; 
regw(PAT_REG1, Ox0FO9090F); 


regw(DST_X, 32); 
regw(DST_Y, 0); 
regw(DST_HEIGHT, 16); 
regw(DST_WIDTH, 32); 


/* ---- 8x1 color pattern ---- */ 
wait_for_fifo(7); 


// enable 8x1 color pattern 
regw(PAT_CNTL, PAT _CLR_8x1_ ENABLE) ; 


// 0x09 - LIGHTBLUE, Ox0F - WHITE 
regw(PAT REGO, Ox0909090F) ; 
regw(PAT_REG1, Ox0F090909); 


regw(DST_X, 80); 
regw(DST_Y, 0); 
regw(DST_HEIGHT, 16); 
regw(DST_ WIDTH, 32); 
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LINE DRAW 


Line draws are performed using an 18-bit Bresenham line draw engine. 


To draw a line: 
1. Set up the draw context with either a context load or many register writes. 


2. Determine the direction octant that the line trajectory will be drawn and set 
the DST_X_DIR, DST_Y_DIR and DST_Y_MAJOR bits accordingly. Also set 
the LAST_PEL_ON bit as desired (this bit only determines whether the last 
pixel in the line is drawn; it has no effect on the actual DST_X, DST_Y 
trajectory). 


3. From the start and endpoints of the line, calculate all the Bresenham 
parameters and write them out: 


DST_BRES_ERR = 2 * min(|dx|, |dyl) - max(|dax]1, |dyl) 
DST_BRES_INC = 2 * min(|dx!, |dyl) 
DST_BRES_DEC = 2 * [min(|dx|,Idyl) - max(|dx|, |dy!)] 


4. Write out the desired number of pixels drawn to DST_BRES_LNTH. 


Sample code 


This is sample code to draw a line. The engine does not support lines in 
24 bpp modes. See LINE.C and DRAW.C for more details. 


/* eee ae ee ee cme a wm tre Oe Oe a ee Oe mm Oe me Om oe 00 0s ee Maes OS OO we a Os a ee Om oe oom on ee ee ee oe ee 
DRAW_LINE - draw a line from (x1, yl) to (x2, y2) 


The drawing of the last pixel in the line is determined by the current 
setting of the DST _CNTL register (LAST_PEL bit). 


void draw_line(int x1, int yl, int x2, int y2) 
{ 

int dx, dy; 

int small, large; 

int x_dir, y_dir, y_major; 

unsigned long err, inc, dec, temp; 


/* determine x & y deltas and x & y direction bits */ 
Lf (xl < x2) 
{ 
ax = x2 - xi; 
x_dir = 1; 
} 
else 
{ 
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1£f (yl < y2) 


{ 
dy = y2 - yl; 
y_dir = 2; 

} 

else 

{ 
dy = yl - y2; 
y_dir = 0; 

} 


/* determine x & y min and max values; also determine y major bit */ 
1£ (dx < dy) 


{ 
small = dx; 
large = dy; 
y_major = 4; 

} 

else 

{ 
small = dy; 
large = dx; 
y_major = 0; 


} 


/* calculate bressenham parameters and draw line */ 
err = (unsigned long) ((2 * small) - large); 

inc (unsigned long) (2 * small); 

dec = Ox3fffFf - ((unsigned long)(2 * (large - small))); 


wait_for_idle(); // wait for idle before reading GUI registers 


// save used registers 
temp = regr(DST_CNTL) ; 


// draw bressenham line 
regw(DST_X, (unsigned long) x1); 
regw(DST_Y, (unsigned long)y1); 


// allow setting of last pel bit and polygon outline bit for line drawing 


regw(DST_CNTL, (temp & 0x60) | (unsigned long) (y_major | y_ dir | x_dir)); 
regw(DST_BRES_ERR, err); 

regw(DST_BRES_INC, inc); 

regw(DST_BRES_DEC, dec); 

regw(DST_BRES_LNTH, (unsigned long) (large + 1)); 


// restore 
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regw(DST_CNTL, temp); 


/* Main C program */ 


void main(void) 
{ 


/* ~--- Draw a line in each octant ---- */ 


// The line draw direction is determined in DST_CNTL. These bits are 

// determined in the draw_line() routine from the given start and end 
// point coordinates. Note that the engine does not support lines in 24 
// bpp. However, it is possible to use the engine to draw each 24 bpp 
// pixel of a line and use a line drawing algorithm such as bressenham 
// to determine where the pixels are drawn. The side effect is a great 
// loss in performance especially if patterned lines are considered. 


// source = foreground color 
wait_for_fifo(1); 
regw(DP_SRC, FRGD_SRC_FRGD_CLR) ; 


// foreground color = WHITE 
set_fg_color(get_color_code (WHITE) ); 


// draw a line 
draw_line(xl1, yl, x2, y2); 
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SCISSORING AND MASKING 


Drawing may be inhibited outside a rectangular region by setting the scissor 
registers. Scissors are inclusive on all edges. Drawing behavior is undefined for 
any objects drawn outside the device coordinate space, whether they are scissored 
or not. The device coordinate space is -4096 to +4095 in the X direction, and -16384 
to +16383 in the Y direction. 


Sample code = SCISSOR. DOC 


This is sample code to setup the engine scissors. See SCISSOR.C for more 
details. 


// Setup rectangle size 
dstx = 0; 

dsty = 0; 

dstwidth = 640; 
dstheight = 480; 


// Set the scissors to the size of the screen mode (inclusive) 
wait_for_fifo(4); 

regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 

regw(SC_RIGHT, dstwidth - 1); 

regw(SC_BOTTOM, dstheight - 1); 


// Draw a WHITE filled rectangle 
set_fg_color(get_color_code (WHITE) ); 
draw_rectangle(dstx, dsty, dstwidth, dstheight); 


// Set the scissors to a slightly smaller size than the drawn rectangle 
wait_for_fifo(4); 

regw(SC_LEFT, 50); 

regw(SC_TOP, 50); 

regw(SC_RIGHT, dstwidth - 50); 

regw(SC_BOTTOM, dstheight - 50); 


// Draw a LIGHTRED filled rectangle of the same size as the previous one. 
// The outer border of the rectangle will be clipped by the scissors. 


set_fg_color(get_color_code(LIGHTRED) ); 
draw_rectangle(dstx, dsty, dstwidth, dstheight); 
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Bits within a particular pixel may be selectively inhibited by setting the 
DP_WRITE_MASK register 


Sample code §WMASK.DOC 


This is sample code to show the effects of the DP_WRITE MASK register. See 
WMASK.C for more details. 


// Set write mask to pass all bits to destination 
wait_for _ fifo(1); 
regw(DP_WRITE_MASK, Oxffffffff); 


// Draw a WHITE filled rectangle - resolution independent 
set_fg_color(get_color _code(WHITE) ); 
draw_rectangle(0, 0, modeinfo.xres / 2, modeinfo.yres / 2); 


// setup mask for WRITE_MASK register 

switch (modeinfo.bpp) 

{ 
case 4: mask OxfffffFCCIO; break; 
case 8: mask = Oxffffff00; break; 
case 16: mask = Oxffff0000; break; 
case 24: mask Oxff000000; break; 
case 32: mask = 0x00000000; break; 


} 


// Set write mask to mask off bits giving a result color of 
// LIGHTMAGENTA without changing the DP_FRGD CLR register value 


wait_for_fifo(1); 
regw(DP_WRITE MASK, mask | get_color_code(LIGHTMAGENTA) ); 


// Draw a filled rectangle with the new write mask setting 
set_fg_color(get_color_code(WHITE) ); 
draw_rectangle(modeinfo.xres / 2, 0, 

modeinfo.xres / 2, modeinfo.yres / 2); 
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SOURCE AND DESTINATION MIXING 


A source and destination pixel may be mixed in two ways: 


* A logical operation or an averaging function may be performed on the source 
and destination to produce a composite pixel. The process may be referred to 
as an ALU function, a mix function, or a ROP (raster operation). 


¢ The color source pixel (before ALU processing) or the destination pixel can be 
compared to a color compare register. If the result is FALSE, the result of the 
ALU is written; otherwise, the destination pixel is written back to the 
destination (no pixel is drawn). In this manner, the source pixel can be 
selectively inhibited from writing to the destination. 


ALU functions and compare functions may be used at the same time, but the ALU 
will only operate on pixels on which the compare function returns FALSE. All 
compare functions and ALU functions are listed in Source and Destination Mixing 
Logic in Chapter 2, Programming Model. 


Sample code = COMPARE.DOC 


This is sample code to show the effect of mixing and the comparing of the 
source with the destination. See COMPARE.C for more details. 


// 


// 


Fill the screen with two filled rectangles - BLUE on top, YELLOW below 


Set foreground mix to OVERPAINT 


set_fg mix(S_MIX); 


set_fg_color(get_color_code(LIGHTBLUE) ); 
draw_rectangle(0, 0, modeinfo.xres / 2, modeinfo.yres / 4); 


set_fg_color(get_color_code (YELLOW) ); 
draw_rectangle(0, modeinfo.yres / 4, 


// 


modeinfo.xres / 2, modeinfo.yres / 4); 


Draw a rectangle on top of the previously drawn one. The YELLOW 
rectangle on the screen is used as the source comparison. Since the 
compare color is YELLOW and the compare function is EQUAL, the 
resulting rectangle draw will not affect the YELLOW rectangle. The 
LIGHTBLUE rectangle, however, will be overwritten. The show mixing, 
the resulting rectangle (written on top of the LIGHTBLUE rectangle) 
will be LIGHTMAGENTA (LIGHTBLUE destination OR LIGHTRED source). 


set mixes 


set_fg mix(D_OR_S_ MIX); 
set_bg_mix(D_MTIX); 
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// compare color = YELLOW 
walt _for_fifo(3); 
regw(CLR_CMP_CLR, get_color_code (YELLOW) ); 


// allow all bits to pass in comparison 
regw(CLR_CMP_MASK, Oxffffrftfff); 


// compare source = destination 
regw(CLR_CMP_CNTL, COMPARE_EQUAL) ; 


// draw LIGHTRED rectangle over both LIGHTBLUE & YELLOW regions 
set_fg color(get_color_code(LIGHTRED) ); 
draw_rectangle(0, 0, modeinfo.xres / 2, modeinfo.yres / 2); 
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POLYGONS 


The maché4 uses an alternate-fill algorithm for polygon filling. Polygon fills are 
simply rectangle fills with the DST_POLYGON_ENA@DST_CNTL bit set. At the 
beginning of each destination scan line, an internal polygon fill flag is reset. 
Whenever this flag is ina reset state, drawing is inhibited. The polygon boundary 
source (this source is implicit, set it up using the blit source registers) is consumed, 
providing polygon boundary data. Whenever a polygon edge is detected, the 
internal polygon fill flag is toggled. Only rectangular destinations proceeding ina 
left-to-right and top-to-bottom direction are supported for polygon filling. 


Note that any monochrome or color sources may be selected in the pixel data path 
except for blit sources (because the blit source registers are used to configure the 
polygon source trajectory) when polygon filling. Polygon boundary source is only 
meaningful when configured to Ibpp pixel depth (set this with 
DP_SRC_PIX_WIDTH@DP_PIX_WIDTH). 


Polygon boundaries are created by drawing lines in Ibpp mode with the 
DST_POLYGON_ENA@DST_CNTL bit set. This bit causes a maximum of one pixel 
per scan line to be drawn (horizontal lines are not drawn at all), and lines exceeding 
the left scissor boundary are saturated to the left scissor. 
To draw a polygon: 

1. Set the destination pixel depth to Ibpp. 

2. Set the scissors to the outline drawing region. 

3. Clear the off-screen area where the polygon outlines are to be drawn. 

4 


Set the mix to XOR. (This takes care of the degenerate case where two polygon 
boundary lines culminate in a vertical peak). 


5. Set the DST_POLYGON_ENA bit. 
6. Draw all polygon outline lines from top to bottom with LAST_PEL_OFF. 
7. Set the scissors to the final destination area. 
8. Set all necessary destination context registers (pixel depth, mix, etc.). 
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9. Set up the blit source registers to point to the polygon outline area. 


10. Set up the blit destination registers to point to the final destination area (on- 
screen memory). 


11. Blit. 
Sample code POLYGON. DOC 


This is sample code to show how to setup and fill a polygon shape. Polygons 
are not supported in 24 bpp modes. 


/* Main C program */ 


int main(void) 

{ 
unsigned long offset, color _depth; 
int width, height; 
int box_x, box_y; 
int xl, yl, x2, y2, xtemp, ytemp; 
int di; 
POINT points [6]; 


// check if Mach6é4 adapter is installed 

if (detect_mach64() != YES_MACH64) 

{ 
printf (“Mach6é4 based adapter was not found.\n”); 
return (1); 


// £111 global query structure by calling Mach 64 ROM 
if (query_hardware() != NO_ERROR) 
printf(”“Failed ROM call to query Mach6é4 hardware.\n”); 
return (1); 
// set an accelerator mode 
1£ (open_mode(MODE_640x480, PITCH _XRES, COLOR_DEPTH_8) != NO_ERROR) 
printf(“Error in setting display mode.\n”); 
return (1); 
// Check for 24 bpp mode - Polygons are not supported in 24 bpp modes 


if (modeinfo.bpp == 24) 


// Adisable accelerator mode and switch back to VGA text mode 
close_mode(); 


printf(“Polygons are not supported in 24 bpp modes.\n”); 
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return (1); 


} 


// setup engine context and clear screen 


init_engine(); 


clear_screen(0, 0, modeinfo.xres, modeinfo.yres) ; 


POLYGONS 


// Polygon shape is a 5 sided pentagon - resolution independent 


points[0].x = 


points[0].y 
points[1].x 


points[1l].y = 


points[2].x 
points[2].y 
points[3].x 
points[3].y 
points[4].x 
points[4].y 
points[5].x 


points[5].y = 


modeinfo.xres / 2; 

modeinfo.yres / 12; 

(modeinfo.xres / 2) + (modeinfo.xres / 4); 
modeinfo.yres / 2; 

(modeinfo.xres / 2) + (modeinfo.xres / 8); 
modeinfo.yres - (modeinfo.yres / 12); 


(modeinfo.xres / 2) - (modeinfo.xres / 8); 
modeinfo.yres - (modeinfo.yres / 12); 
(modeinfo.xres / 2) - (modeinfo.xres / 4); 


modeinfo.yres / 2; 
points[0] .x; 
points[0].y; 


// determine width, height, x & y offsets 
width = points[1].x - points[4].x + 1; 
height = points[2].y - points[0].y + 1; 
box_x = points[4].x; 

box_y points[0].y; 


// Draw outline of shape to be filled to show the inclusivity of the 
// polygon fill. The bottom line is a horizontal line. Since polygon 
// outlines are drawn upto but not including the last pixel, this line 
// will remain. For the same reason, the top pixel of the shape will 
// remain. 


// Draw outline with regular lines in WHITE using the same top to bottom 
// drawing direction. 

set_fg_ color(get_color_code(WHITE) ); 

for (1 = 0; 1 < 5; i++) 


{ 
x1 = points[i] .x; 
yl = points[i] .y; 
x2 = points[i+1] .x; 
y2 = points[i+1] .y; 


// swap points if direction is not top to bottom 
if (yl > y2) 


ytemp = yl; 
yl = y2; 
y2 = ytemp; 
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x2 = xtemp; 
} 


draw_line(x1, yl, x2, y2); 
} 


// wait for a carriage return to continue 
getch(); 


/* ---- Draw filled polygon ---- */ 
// Calculate dword offset address of the start of off-screen memory 


offset = (unsigned long) (modeinfo.yres) ; 
offset = (unsigned long) (offset * modeinfo.pitch); 
if (modeinfo.bpp == 4) 
{ 
offset = (unsigned long) (offset / 2); 
} 
else 
{ 
offset = (unsigned long) (offset * (modeinfo.bpp / 8)); 
} 


// convert byte offset to dword offset 
offset = offset / 8; 


// 1. Set destination to 1 bpp for memory clearing and outline drawing 
wait_for_fifo(16); 
regw(DP_PIX_WIDTH, HOST_1BPP | SRC_IBPP | DST_1BPP); 


// 2. Set scissors to outline drawing region 
regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 

regw(SC_RIGHT, width-1); 

regw(SC_BOTTOM, height-1); 


// 3. Clear off-screen memory where polygon outlines are to be drawn 


// insure engine idleness for GUI register reading 
wait_for_idle(); 


// set destination operations to off-screen memory 
regw(DST_OFF_ PITCH, regr(DST_OFF_PITCH) | offset); 


// clear memory 

regw(DP_SRC, FRGD_SRC_FRGD_CLR) ; 
regw(DP_MIX, FRGD_MIX_ZERO | BKGD_MIX_ ZERO); 
regw(DST_X, 0); 

regw(DST_Y, 0); 
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regw(DST_HEIGHT, height) ; 
regw(DST_WIDTH, width) ; 


// 4. Set mix to XOR 
regw(DP_MIX, FRGD_MIX_D XOR_S | BKGD_MIX ZERO); 
regw(DP_FRGD_CLR, 1); 


// 5. Set the DST_POLYGON_ENABLE bit, clear LAST_PEL bit 
regw(DST_CNTL, DST_POLYGON_ENABLE | 
DST_Y_TOP_TO_BOTTOM | DST_X_LEFT_TO_RIGHT) ; 


// 6. Draw all polygon outlines from top to bottom with LAST_PEL_OFF 
for (i = 0; i < 5; i++) 
{ 


x1 = points[{i].x - box_x; 
yl = points[i]l.y - box_y; 
x2 = points[{i+1].x - box_x; 
y2 = points[i+l].y - box_y; 


// swap points if direction is not top to bottom 
if (yl > y2) 


ytemp = yl; 
yl = y2; 
y2 = ytemp; 


xtemp = x1; 

x1 = x2; 

x2 = xtemp; 
} 


draw_line(xl, yl, x2, y2); 
} 


// 7. Set scissors to the final destination area 
wait_for_fifo(4); 

regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 

regw(SC_RIGHT, modeinfo.xres-1); 

regw(SC_BOTTOM, modeinfo.yres-1) ; 


// 8. Set all necessary destination context registers 


// insure engine idleness for GUI register reading 
wait_for_idle(); 


// get source operations to off-screen memory 
regw(SRC_OFF_PITCH, regr(SRC_OFF_PITCH) | offset); 
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// set destination operations to on-screen memory 


regw(DST_OFF_PITCH, regr(DST_OFF_PITCH) & Oxffc00000); 


// set destination color depth to current mode 

switch (modeinfo.bpp) 

{ 
case 4: color_depth = DST_4BPP; break; 
case 8: color depth = DST_8BPP; break; 
case 16: 


if (modeinfo.depth == 555) // 555 color weighting 


{ 
color_depth = DST_15BPP; 


} 


else // 565 color weighting 


{ 


color_depth = DST_16BPP; 
} 
break; 
case 32: color_depth = DST_32BPP; break; 
} 


regw(DP_PIX WIDTH, HOST_1BPP | SRC_1IBPP | color_depth) ; 


// set desired mix and color values 
set_fg mix(S_MIX); 

set_bg_mix(D_MIX); 
set_fg_color(get_color_code(LIGHTRED) ); 


// 9. Setup blit source registers to point to polygon outline area 


wait_for fifo(8s); 
regw(SRC_X, 0); 
regw(SRC_Y, 0); 
regw(SRC_HEIGHT1, height); 
regw(SRC_WIDTH1, width) ; 


// 10. Setup blit destination registers to point to final destination area 


regw(DST_X, box_x); 
regw(DST_Y, box_y); 
regw(DST_HEIGHT, height) ; 


// 11. Blit 
regw(DST_WIDTH, width); 


// wait for a carriage return 
getch(); 


// Aisable accelerator mode and switch back to VGA text mode 


close_mode(); 


return (0); 
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DRAWING IN PACKED 24 BIT PER PIXEL MODE 


There is no 24-bit packed draw engine mode, but there is a 24-bit packed display 
mode. Drawing in this mode is accomplished by setting the engine in 8 bit per pixel 
mode and manipulating the DST_24_ROT and DST_24_ROT_ENA bits. The 
following rules must be followed for drawing in this mode: 


¢ Set source and destination pitches to three times the display pitch. 


* All X coordinates and widths must be specified at three times the normal 
value. Remember that left-to-right operations begin on an R value, and right- 
to-left operations begin on a B value. That means that for left-to-right 
operations, the initial DST_X is expressed as (X * 3) and for right-to-left DST_X 
is (X*3 +2). 


Before any draw operation is initiated, the DST_24_ROT_ENA@DST_CNTL 
must be enabled, and DST_24 ROT@DST_CNTL must be set to ((DST_X / 4) 
mod 6), where DST_X is the starting DST_X value as described above. 


In the above table: 


* Xis the desired X coordinate in packed 24bpp mode. 


es ¢ DST_X is the value that you actually write to the draw engine (remember to 
start on an R component on left-to-right operations, and on a B component for 
right-to-left operations). 


The DWORD and color components rows show how memory is actually laid 
out in relation to pixel data. 


The DST_24_ROT row shows the value to place in the 
DST_24_ROT@DST_CNTL field before initiating a draw operation. Use the 
leftmost DST_24_ROT number in the column for left-to-right operations, and 
the rightmost number for right-to-left operations. 


The DST_24_ROT value is simply the (DWORD-value-of-the-starting-byte 
mod 6). 


The rotation enable bit only affects DP_FRGD_CLR, DP_BKGD_CLR, 
DP_WRITE_MASK, and fixed 8x8 mono patterns. Colors and masks are 
rotated appropriately, keying on the DST_24_ROT value. 


¢ The line draw engine does not function in 24bpp packed mode. 
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SCROLLING AND PANNING 


Scrolling and panning of the display area to the limits of the draw area can be simply 
done by changing the value of CRTC_OFFSET@CRTC_OFF_PITCH. 


Ee Note that offset has a granularity of 64 bits, which means that horizontal panning 
will be more “jerky” at lower pixel depths than at higher pixel depths. 


Sample code PAN .DOC 


This is sample code to show panning of a 1024x1024 image in a 640x480 
viewable screen area. See PAN.C for more details. 


// Keyboard scan and ascii codes for demo 


#define ESC Oxlb // ascii code 
#define LEFT_ARROW O0x4b // scan code 
#define RIGHT ARROW 0x4d // scan code 
#define UP_ARROW 0x48 // scan code 
#define DOWN_ARROW 0x50 // scan code 


#define HOME 0x47 // scan code 
#define END Ox4f // scan code 
/* ttt rrcs rcs 


GET_KEY - get keyboard scan code using system ROM call 


Upper 8 bits = scan code 
Lower 8 bits = ascii code 


ee ee me ee coe ce ta cee mee se ee ee Se Oe SE SD OD OE Om OD OE Oe me Om wn MS Ow Ome OD ons O00 me mie es Gms es ew tats tne ee ie tm ce em ee ee em ee tee cae oe ee oe ee eo et ee * / 
int get_key(void) 
{ 
union REGS regs; 
regs.x.ax = 0x1000; 
int86(0x16, &regs, &regs); 
return (regs.x.ax); 
} 
/* Main C program */ 
int main(void) 
{ 
unsigned long offset; 
int color, ch; 
int xindex, yindex, xmax, ymax; . 
int max_x, max_y, size_x, size_y; 
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// check if Mach64 adapter is installed 
if (detect_mach64() != YES _MACH64) 


printf(“Mach64 based adapter was not found.\n”); 
return (1); 


// £111 global query structure by calling Mach 64 ROM 
if (query_hardware() != NO_ERROR) 


printf(”“Failed ROM call to query Maché4 hardware.\n”); 
return (1); 


// set an accelerator mode of 640x480 with a pitch of 1024 
if (open_mode(MODE_ 640x480, PITCH_1024, COLOR_DEPTH_8) != NO_ERROR) 


printf(“Error in setting display mode.\n”); 
return (1); 


/* 
To scroll and pan the display, the viewable display area must be 
smaller than the CRTC display area. In this case, the viewable area 
is 640x480 while the CRTC display area is 1024x1024. 

bad f 


// ‘desktop’ size = 1024 x 1024 
max_x = 1024; 
max_y = 1024; 


// ‘viewable screen size = 640 x 480 
size x = 640; 
size_y = 480; 


// setup engine context and clear screen 
init_engine(); 


// adjust engine scissors to desktop area (1024x1024) 
wait_for_fifo(4); 

regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 

regw(SC_RIGHT, max_x-1); 

regw(SC_BOTTOM, max_y-1); 


// clear area 
clear _screen(0, 0, max_x, max_y); 


// Pill area with some full size rectangles (CRITIC width) 
for (color = DARKBLUE-1; color < WHITE; color++) 
{ 

set_fg_color(get_color_code(color + 1)); 
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draw_rectangle(color 


} 


color 
max _xX 
max_y 


* 30, 

* 30, 

- (color * 60) - 1, 
- (color * 60) - 1); 


// Scroll and pan around image: 


// 


// Viewable display is 640x480 


// Image size is 1024x 
// 

// ESC - exit 
// LEFT_ARROW - move 
// RIGHT_ARROW - move 
// UP_ARROW - move 
// DOWN_ARROW - move 
// HOME - - move 
// END - move 
// 

xindex = 0; 

yindex = 0; 

xmax = (max_x - size_x) 


ymax = (max_y - size_y) 
if (modeinfo.bpp == 4) 


amax 


xmax / 2; 


1024 


left 

right 

up 

down 

to top-left corner 

to bottom-right corner 


// 2 pixels per step 
// 2 lines per step 


~ 
DO Db 
me NO 


xmax = xmax * (modeinfo.bpp / 8); 


offset = 0; 
ch = 0; 

while ((ch & Oxff) 
{ 


!= ES 


C) 


// wait for key input 


while (kbhit() 


ch = get_key(); 


== 0) 


e 
a 


// position CRTC offset according to key input 


1£ ((ch & Oxff£) 


{ 


I= E 


switch (ch >> 8) 


{ 


SC) 


case LEFT_ARROW: // move left 2 pixels 
if (xindex > 0) 


{ 
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offs 
xind 


et = offset - 1; 
ex xindex - 4; 


(in 8 bpp) 
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} 
break; 


case RIGHT _ARROW: // move right 2 pixels 
if (xindex < xmax) 


{ 
offset = offset + 1; 
xindex = xindex + 4; 
} 
break; 
case UP_ARROW: // move up 2 lines 
if (yindex > 0) 
{ 
1£f (modeinfo.bpp == 4) 
{ 
offset = offset - (256 / 2); 
} 
else 
{ 
offset = offset - (256 * (modeinfo.bpp / 8)); 
} 
yindex = yindex - 1; 
} 
break; 


case DOWN_ARROW: // move down 2 lines 
if (yindex < ymax) 
{ 
1£ (modeinfo.bpp == 4) 
{ 


offset offset + (256 / 2); 


} 
else 
{ 
offset = offset + (256 * (modeinfo.bpp / 8)); 
} 
yindex = yindex + 1; 
} 
break; 


case HOME: // move to top-left corner (0, 0) 
offset = 0; 
xindex = 0; 
yindex = 0; 
break; 
case END: // move to bottom-right corner 


// calculate dword address of coordinate (1024-640,1024-480) 


offset = (unsigned long) (max_y - size_y); 
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(max_x - size_x))); 


} 
} 


} 


offset = (unsigned long) (offset * max_x); // pitch 
if (modeinfo.bpp == 4) 
{ 
offset = (unsigned long) (offset / 2); 
offset = (unsigned long) (offset + ((max_x - size_x) / 2)); 
} 
else 
{ 
offset = (unsigned long) (offset * (modeinfo.bpp / 8)); 
offset = (unsigned long) (offset + ((modeinfo.bpp / 8) * 


} 


offset = offset / 8; 
xindex = xmax; 
yindex = ymax; 
break; 


// vary CRTC offset while maintaining mode pitch 
-Low(LoCRTC_OFF_PITCH, (ior(ioCRTC_OFF_PITCH) & Oxffc00000)| offset) ; 


// Gisable accelerator mode and switch back to VGA text mode 


close_mode(); 


return (0); 


INTERRUPTS 
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The mach64 is able to generate hardware interrupts under a variety of conditions: 


* Interrupt on command FIFO overflow (BUS_CNTL) 

* Interrupt on host data error (BUS_CNTL) 

¢ Interrupt on CRTC vertical blank (CRTC_INT_CNTL) 

¢ Interrupt on CRTC vertical line count == CRTC_VLINE (CRTC_INT_CNTL) 


It is not recommended that interrupts be used in retail software applications, 
because ISA-based systems tend to be fully loaded with hardware-interruptable 
devices, and ISA interrupts are not shareable. 


See Single Buffering (Synchronized) on page 3-22 for an example of how interrupts 
can be used. 
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A transparent blit is simply a blit where a designated color (background color) 
from the source is inhibited from being drawn to the destination. This kind of blit 
is useful for copying odd-shaped objects onto a bitmapped background (games, for 
example). A simple blit with source compare enabled will do a transparent blit. 


Sample code TBLIT.DOC 


This is sample code to show a transparent blit. See TBLIT.C and DRAW.C for 
more details. 


// Draw a background with different colored rectangles - resolution 
// independent 
for (1 = BLACK; 1 <= WHITE; i++) 
{ 
set_fg_ color(get_color_code(i)); 
draw_rectangle(i * (modeinfo.xres 
i * (modeinfo.yres 
modeinfo.xres - (1 
modeinfo.yres - (i 


40), 

40), 

2 * (modeinfo.xres / 40)), 
2 * (modeinfo.yres / 40))); 


**eNNS 


// Draw the source to be blited 
set_fg_color(get_color_code(WHITE) ); 
draw_rectangle(0, 0, modeinfo.xres / 8, modeinfo.yres / 6); 


set_fg_ color(get_color_code(LIGHTRED) ); 
draw_rectangle(modeinfo.xres / 64, 
modeinfo.yres / 48, 
(modeinfo.xres / 8) - (2 * (modeinfo.xres / 64)), 
(modeinfo.yres / 6) - (2 * (modeinfo.yres / 48))); 


set_fg_color(get_color_code(LIGHTBLUE) ); 
draw_rectangle(modeinfo.xres / 32, 
modeinfo.yres / 24, 
(modeinfo.xres / 8) - (2 * (modeinfo.xres / 32)), 
(modeinfo.yres / 6) - (2 * (modeinfo.yres / 24))); 


// Set color compare registers to source compare: 


// In this example, the LIGHTBLUE center of the source rectangle 

// is selected as being the source color to ignore when bliting. 

// The resultant blit will contain all the source data except pixels 
// having the same color as the compare color (LIGHTBLUE). Note that 
// the compare source is SOURCE. Also, the compare color and the color 
// pixel being compared are both ANDed with the color compare mask. 
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wait_for_fifo(3); 

regw(CLR_CMP_CNTL, COMPARE SOURCE | COMPARE EQUAL); 
regw(CLR_CMP_ CLR, get_color_code(LIGHTBLUE) ) ; 
regw(CLR_CMP_MASK, Oxffffffff); 


// draw several transparent blits at different locations 


// set sre type for blit 
walt_for_fifo(1); 
regw(DP_SRC, FRGD_SRC_BLIT) ; 


for (1 = 0; 1 < 63; i++) 
{ 
b1it(0, 0, (modeinfo.xres / 8) + (i * (modeinfo.xres / 8)), 
i * (modeinfo.xres / 8), 
modeinfo.xres / 8, 
modeinfo.xres / 8); 


// By changing the compare function from COMPARE_EQUAL to 
// COMPARE _NOT_EQUAL, the resultant blit will contain ONLY the source 
// data having the same color as the compare color (LIGHTBLUE). 


wait_for_fifo(1); 
regw(CLR_CMP_CNTL, COMPARE SOURCE | COMPARE NOT _ EQUAL) ; 


// draw several transparent blits at different locations 

for (1 = O; 1 < 6; i++) 

{ 

bl1lit(0, 0, (2 * (modeinfo.xres / 8)) + (1 * (modeinfo.xres / 8)), 

1 * (modeinfo.xres / 8), 
modeinfo.xres / 8, 
modeinfo.xres / 8); 

} 


// wait for a carriage return 
getch(); 


// @Aisable accelerator mode and switch back to VGA text mode 
close_mode(); 


return (0); 
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For smooth animation, it is necessary to inhibit drawing to areas of the screen that 
are currently being scanned by the CRT controller. Failure to take necessary 
precautions will cause flickering or tearing effects on the animated object. Outlined 
below are several possible strategies that can be used for smooth animation. 


Double Buffering (Memory) 


Two areas of screen memory are allocated, each big enough for an entire display 
screen. While one memory area is being displayed, the other is updated, thus 
avoiding any collision between the CRTC and the draw engine. The system timer 
can be used to generate interrupts at constant time intervals. 


In the interrupt service routine for the system timer: 
1. Wait-for-idle to ensure that the draw engine is not in the middle of drawing. 


2. Set CRT_OFFSET to toggle to the memory area to display. Optionally, the 
application may wait for a vertical blank or a vertical line range before writing 
to CRT_OFFSET to prevent tearing between the two images. However, this 
would significantly degrade system performance, and the torn display 
disappears after 1/43rd to 1/76th of a second (depending on refresh rate) so 
it is unlikely to have any visible impact on animation smoothness. 


3. Set DST_OFFSET to enable writing to the non-displayed area. 
4. Signal the application program that the display offset has changed. 


In the mainline application: 


1. Determine which memory area is being displayed. Remember that the 
displayed and non-displayed areas are not identical (they are one frame apart) 
and the application must update the non-displayed area by two frames. 


Disable interrupts for critical draw operations. 


Update the non-displayed area with critical draw operations. A critical draw 
operation may be something like an undraw of an object and a redraw ata 
new location. The application does not want the CRT_OFFSET to change until 
the object is completely redrawn at the new location. 


4. Enable interrupts. 


The buffer switching may also be done in the main line application, and using the 
system timer to switch buffers is optional. The advantage to using the system timer 
is a constant frame rate. 
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Sample code SAMPLE .DOC 


This is sample code to show image drawing using double buffering. This 
example uses 8 targa files located in the IMAGE directory for its source 
images. See below for description. 


DBUF.C 


Example code to show double buffered drawing using the system timer. 
Several image “frames” are loaded into off-screen memory so that they can 
blited to the screen in sequence. Image tearing is avoided by using a 
double page buffering. The page being updated is never the page being 
displayed. In this example, the page buffer is controlled by waiting for a 
system timer tick which yields an approximate frame rate of 18. A page swap 
is done every timer tick. Double buffering does not require waiting for a 
specific vertical line or blanking before drawing. If the frame is more 
than approximately half the vertical sync frequency, image tearing will 
occur. In this case, it will be necessary to wait for a specific vertical 
line before drawing. 


Copyright (c) 1994 ATI Technologies Inc. All rights reserved 


#include <stdio.h> 

#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 

#include <dos.h> 

#include “..\util\atim6é4.h” 
#include “..\util\sample.h” 
#include “..\util\vtga.h” 
#include “isr8.h” 


#define BACKGROUND_COLOR Ox1l1 


/ 


* Main C program */ 


int main(void) 


{ 


char filename[20]; 
TARGA_HEADER header; 
int width, height; 
int srcx, srcy; 

int savexl, saveyl; 
int savex2, savey2; 
int savex, savey; 
int i, Jj; 

int y_draw; 
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int frame; 

int frames; 

int framesperwidth; 
int topline; 

int y_update; 

int update_flag; 

int old_flag; 

int step; 

unsigned long offset; 
POINT points [8]; 


// check if Mach6é4 adapter is installed 

if (detect_mach64() != YES_MACH64) 

{ 
printf(“Mach64 based adapter was not found.\n”); 
return (1); 

} 


// £111 global query structure by calling Mach 64 ROM 

if (query_hardware() != NO_ERROR) 

{ 
printf(“Failed ROM call to query Mach6é4 hardware.\n”); 
return (1); 


} 


// check if Mach 64 VGA controller is enabled 
if (querydata.vga_type != VGA_ENABLE) 
{ 
printf(“This sample code example requires an enabled Mach 64 VGA 
controller. \n”); 
return (1); 
} 


// set an accelerator mode 
if (open_mode(MODE_ 640x480, PITCH _XRES, COLOR_DEPTH_8) != NO_ERROR) 
{ 

printf(“Error in setting display mode.\n”); 

return (1); 


} 


// setup engine context and clear screen 
init_engine(); 
clear_screen(0, 0, modeinfo.xres, modeinfo.yres) ; 


// get targa header information © 
if (get_targa_header(”..\\image\\framel.tga”, &header) != SUCCESS) 
{ 
close_mode(); 
printf(“Error reading targa file header information. \n”); 
return (1); 
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// setup image size, source area, and save area 
width = header.width; 

height = header.height; 

srcex = 0; 

srcy = (modeinfo.yres * 2) + height; 
savexl = 0; 

saveyl = modeinfo.yres * 2; 

sgavex2 = width; 

gavey2 = saveyl; 

y_draw = modeinfo.yres - height; 
step = 2; 


// inform ISR where second page begins (QWORD address) 


// calculate byte address 

offset = (unsigned long) (modeinfo.yres) ; 

offset = (unsigned long) (offset * modeinfo.pitch) ; 
if (modeinfo.bpp == 4) 


{ 
offset = (unsigned long) (offset / 2); 
} 
else 
{ 
offset = (unsigned long) (offset * (modeinfo.bpp / 8)); 
} 


// convert byte address to qword address 
offset = offset / 8; 


// add current pitch from CRTC_OFF_PITCH to variable 
offset = offset | (Lor(1loCRTC_OFF_PITCH) & Oxffc00000); 
setcrtcoffset (offset) ; 


// determine how large to expand the scissors 
frames = 8; 
framesperwidth = modeinfo.xres / width; 
topline = frames / framesperwidth; 
1f ((topline * framesperwidth) != frames) 
{ 
topline++; 
} 
topline = ((topline + 1) * height) + (modeinfo.yres * 2); 


// expand scissors 
wait_for_fifo(4); 

regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 

regw(SC_RIGHT, modeinfo.xres - 1); 
regw(SC_BOTTOM, topline - 1); 


// setup background the same as the image background 
set_fg_color(BACKGROUND_COLOR) ; 


PRG888GX0-01 © 1993 ATI Technologies Inc. 
4-18 Proprietary and Confidential 


CRT SYNCHRONIZATION 


draw_rectangle(0, 0, modeinfo.xres, modeinfo.yres); 


// copy background (first buffer) to second buffer 
blit(0, 0, 0, modeinfo.yres, modeinfo.xres, modeinfo.yres) ; 


// load source images 

frame = 0; 

i= 0; 

j = 0; 

while (frame < frames) 

{ 
// record each frame coordinate 
points[frame].x = srcex + (width * 1); 
points[frame].y = srcy + (height * Jj); 


// load next frame image into video memory 
sprintf(filename, “..\\image\\frame%d.tga”, frame+1); 
1f (load_targa(filename, points[frame].x, points[frame].y) != SUCCESS) 
{ 
close_mode(); 
printf(“Error loading targa file to memory.\n”); 
return (1); 
} 


// adjust location of frame load coordinate as necessary 
frame++; 

1+4+; 

1f (1 > ((modeinfo.xres / header.width) - 1)) 


// set palette from targa color table (8 bpp) 
if (header.pixel_ depth == 8) 


{ 
if (set_targa_palette(”..\\image\\framel.tga”) != SUCCESS) 
{ 
close_mode(); 
printf(“Error reading targa file color table information. \n”); 
return (1); 
} 
} 


// wait for a key to start 
getch(); 


// setup for blits 
wait_for_fifo(3); 
regw(DP_SRC, FRGD_SRC_BLIT) ; 
regw(SRC_CNTL, 0); 
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regw(DST_CNTL, DST_Y_TOP_TO_BOTTOM | DST_X_LEFT_TO_ RIGHT); 


// copy starting postions to buffer save areas 
bl1it(0, y_draw, savexl1, saveyl1, width, height); 
blit(0, modeinfo.yres + y_ draw, savex2, savey2, width, height); 


// setup loop variables 

i = O; 

frame = 0; 

update_flag = getswapflag(); 
old_flag = update_flag; 


// enable timer ISR for page swaps each timer tick 
inittimerisr(); 


// main draw loop 
while (i < (modeinfo.xres-width) ) 


{ 
// syncronize frame updates with ISR 
while (update_flag == old_flag) 
{ 
update_flag = getswapflag(); 
} 
old_flag = update_flag; 
// don’t allow ISR to swap page while updating frame 
interrupts_off(); 
// set update and save variables according to active frame 
1£f (update_flag == 1) 
{ 
// display frame 1, update frame 2 
y_update = modeinfo.yres; 
gavex = savexX2; 
gsavey = savey2; 
} 
else 
{ 
// Gisplay frame 2, update frame 1 
y_update = 0; 
gsavex = savexl1; 
gsavey = saveyl; 
} 
// restore current frame from last frame (each page lags by 2 frames) 
if (i > step) 
{ 
blit(savex, savey, i - (2 * step), y_update + y_ draw, width, height) ; 
} 
// save current frame 
blit(i, y_update + y draw, savex, savey, width, height); 
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// update current frame 
blit (points[frame].x, points[frame].y, i, y_update + y_draw, width, height); 


// allow ISR to swap page 
interrupts_on(); 


// increment image position 
i =i + step; 


// determine next frame image 
frame++; 
if (frame >= frames) 
{ 
frame = 0; 
} 
} 


// disable timer ISR 
canceltimerisr(); 


// wait for a carriage return 
getch(); 


// @disable accelerator mode and switch back to VGA text mode 
close_mode(); 


return (0); 
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Double Buffering (Palette) 


The palette-driven double buffer is just a specialized case of the double buffer 
scheme described above. In 8bpp mode, two memory areas can be allocated, 
overlaid on top of each other, each 4bpp deep. The palette must be defined such 
that the lower four bits and the upper four bits specify the same 16 colors. The same 
algorithm is used as above, except that DAC_MASK is used to switch the displayed 
area, and DP_WRITE_MASK is used to write to the non-displayed area. 


Single Buffering (Synchronized) 


Simple animations (small update areas) may be accomplished with a single buffer 
with no flickering or tearing by refraining from drawing until the CRTC vertical 
line count is within a certain range. The vertical line count can be polled by reading 
CRTC_CRNT_VLINE@CRTC_VLINE_CRNT_VLINE or it can be interrupt-driven 
by setting CRTC_VLINE_INT_EN@CRTC_INT_CNTL and 
CRTC_VLINE@CRTC_VLINE_CRNT_VLINE. 


Ee Interrupts from the mach64 chip are not recommended because ISA systems cannot 
share interrupts, and commonly run out of IRQ levels. 


Sample code SBUFP .DOC 


This is sample code to show image drawing using vertical line polling. This 
example uses 8 targa files located in the IMAGE directory for its source 
images. See below for description. 


SBUFP.C 


Example code to show single buffered drawing by polling. Several image 
“frames” are loaded into off-screen memory so that they can blited to 

the screen in sequence. To prevent image tearing, a blit is not performed 
until the display reaches a specific vertical line (usually after the lower 
part of the blit). In this example, this is done by polling VLINE register. 
The frame rate is determined by the vertical frequency of the display 
unless the draw code can not keep up with the frame rate. 


Copyright (c) 1994 ATI Technologies Inc. All rights reserved 


Serr SSeS eee Sete See See SSS SSS SS SSS SESS SBE SSS SS SES SS SSS ESS SSS SSeS ss sss sss=s===* / 


#include <stdio.h> 

#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 

#include <dos.h> 

#include “..\util\atim6é4.h” 
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#include “..\util\sample.h” 
#include “”..\util\vint.h” 
#include “”..\util\vtga.h” 


#define BACKGROUND_COLOR Ox11 


/* Main C program */ 


int main(void) 

{ 
char filename[20]; 
TARGA_HEADER header; 
int width, height; 
int srcx, srcy; 
int savex, savey; 
int i, Jj; 
int y_draw; 
int frame; 
int frames; 
int framesperwidth; 
int topline; 
int step; 
POINT points[8]; 


// check if Mach64 adapter is installed 

if (detect_mach64() != YES_MACH64) 

{ 
printf (“Mach64 based adapter was not found.\n”); 
return (1); 


// £111 global query structure by calling Mach 64 ROM 
if (query_hardware() != NO_ERROR) 


printf(”“Failed ROM call to query Mach6é4 hardware.\n”); 
return (1); 


// check if Mach 64 VGA controller is enabled 
if (querydata.vga_type != VGA_ENABLE) 


printf(“This sample code example requires an enabled Mach 64 VGA 
controller.\n”); 
return (1); 
} 


// set an accelerator mode 
if (open_mode(MODE_640x480, PITCH_XRES, COLOR_DEPTH_8) != NO_ERROR) 
{ 

printf(”“Error in setting display mode.\n”); 

return (1); 
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} 


// setup engine context and clear screen 
init_engine(); 
clear_screen(0, 0, modeinfo.xres, modeinfo.yres) ; 


// get targa header information 
1f (get_targa_header(”..\\image\\framel.tga”, &header) != SUCCESS) 
{ 
close_mode() ; 
printf(”“Error reading targa file header information. \n”); 
return (1); 
} 


// setup image size, source area, save area, and position increment 
width = header.width; 

height = header.height; 

srcox = 0; 

srcey = modeinfo.yres + height; 
savex = 0; 

savey = modeinfo.yres; 

y_draw = modeinfo.yres - height; 
step = 2; 


// determine how large to expand the scissors 
frames = 8; 
framesperwidth = modeinfo.xres / width; 
topline = frames / framesperwidth; 
if ((topline * framesperwidth) != frames) 
{ 
topline++; 
} 
topline = ((topline + 1) * height) + modeinfo.yres; 


// expand scissors to include source and save areas 
wait _for_fifo(4); 

regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 

regw(SC_RIGHT, modeinfo.xres - 1); 

regw(SC_BOTTOM, topline - 1); 


// draw background the same as the image background 
set_fg_color(BACKGROUND_COLOR) ; 
draw_rectangle(0, 0, modeinfo.xres, modeinfo.yres) ; 


// load source images into off-screen memory for frame blitting and 
// record their position 

frame = 0; 

i = O; 

j = 0; 

while (frame < frames) 

{ 
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// record each frame coordinate 
points[frame].x = srcex + (width * 1); 
points[{frame].y = srcy + (height * 4); 


// load next frame image into video memory 
sprintf(filename, “..\\image\\frame%d.tga”, frame+1); 
if (load_targa(filename, points[frame].x, points[frame].y) != SUCCESS) 
{ 
close_mode(); 
printf (“Error loading targa file to memory.\n”); 
return (1); 
} 


// adjust location of frame load coordinate as necessary 
frame++; 

1++; 

if (1 > ((modeinfo.xres / header.width) - 1)) 


// set palette from targa color table (8 bpp) 
if (header.pixel_depth == 8) 
{ 
if (set_targa_palette(”..\\image\\framel.tga”) != SUCCESS) 
{ 
close_mode(); 
printf (“Error reading targa file color table information.\n”); 
return (1); 


} 


// wait for a key to start 
getch(); 


// setup engine for blits 

wait_for_fifo(3); 

regw(DP_SRC, FRGD_SRC_BLIT) ; 

regw(SRC_CNTL, 0); 

regw(DST_CNTL, DST_Y_TOP_TO_BOTTOM | DST_X LEFT _TO RIGHT); 


// set vline to wait (to prevent image tearing) and enable 
1f (y_draw < (modeinfo.yres - height) ) 


{ 
set_vline(y_draw + height); 
} 
else 
{ 
set_vline(modeinfo.yres) ; 
} 
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enable vlineint (); 


// main draw loop 

i= OO; 

frame = 0; 

while (1 < (modeinfo.xres - width) ) 

{ 
// @aisplay old frame, update new frame 
wait_for_vline(); // wait for vertical line 


// restore previous frame (if first frame has occurred already) 
1f (i > 0) 
{ 
blit(savex, savey, i-step, y_draw, width, height); 
} 


// save current frame 
blit(i, y_draw, savex, savey, width, height); 


// update current frame 
blit (points[frame].x, points[frame].y, i, y_draw, width, height); 


// eycle through frames 
1 =i + step; 
frame++; 
if (frame >= frames) 
{ 
frame = 0; 
} 
} 


// disable vline interrupts 
disable vlineint(); 


// wait for a carriage return 
getch(); 


// @aisable accelerator mode and switch back to VGA text mode 
close_mode(); 


return (0); 
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SBUFI.DOC 


This is sample code to show image drawing using vertical line interrupts. 
This example uses 8 targa files located in the IMAGE directory for its source 
images. See below for description. 


SBUFI.C 


Example code to show single buffered drawing using interrupts. Several 
image “frames” are loaded into off-screen memory so that they can blited to 
the screen in sequence. To prevent image tearing, a blit is not performed 
until the display reaches a specific vertical line (usually after the lower 
part of the blit). In this example, this is done by setting the VLINE 

and INT_CNTL registers to trigger an interrupt service routine (ISR). For 
the ISR to function, the Mach6é4’s IRQ2 jumper must be installed. The ISR 
may fail to function correctly if the IRQ channel is used by other hardware 
or is prevented to function. 


Note that the blit code is in the ISR instead of the main loop (as is the 
case for the polled example - SBUFP). 


The frame rate is determined by the vertical frequency of the display 
unless the draw code can not keep up with the frame rate. 


Copyright (c) 1994 ATI Technologies Inc. All rights reserved 


#include <stdio.h> 

#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 

#include <dos.h> 

#include “..\util\atim64.h” 
#Hinclude “..\util\sample.h” 
#include “..\util\vint.h” 
#include “..\util\vtga.h” 
#include “visr.h” 


#define IRQ 2 
#define IRQ TIMEOUT 18 
#define BACKGROUND_COLOR 0x11 


/* Main C program */ 


int main(void) 

{ 
char filename[20]; 
TARGA HEADER header; 
int width, height; 
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int srcex, srcy; 

int savex, savey; 
int i, Jj; 

int y_draw; 

int update; 

int frame; 

int frames; 

int framesperwidth; 
int topline; 

int step; 

int oldcount; 
unsigned int starttick, endtick; 
POINT points[8]; 


// check if Mach6é4 adapter is installed 

if (detect_mach64() != YES_MACH64) 

{ 
printf(“Mach64 based adapter was not found.\n”); 
return (1); 


} 

// €ill global query structure by calling Mach 64 ROM 
1f (query_hardware() != NO_ERROR) 

{ 


printf(“Failed ROM call to query Mach6é4 hardware.\n”); 
return (1); 


// check if Mach 64 VGA controller is enabled 
if (querydata.vga_type != VGA_ENABLE) 


printf(“This sample code example requires an enabled Mach 64 VGA 
controller.\n”); 
return (1); 
} 


// set an accelerator mode 
1f (open_mode(MODE_ 640x480, PITCH_XRES, COLOR_DEPTH_8) != NO_ERROR) 
{ 
printf(”“Error in setting display mode.\n”); 
return (1); 
} 


// setup engine context and clear screen 
init_engine(); 
clear_screen(0, 0, modeinfo.xres, modeinfo.yres) ; 


// get targa header information 
if (get_targa_header(”..\\image\\framel.tga”, &header) != SUCCESS) 


{ 
close_mode(); 
printf(”“Error reading targa file header information. \n”); 
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return (1); 
} 


// setup image size, source area, save area, and position increment 
width = header.width & Oxfffe; 

height = header.height & Oxfffe; 

srcex = 0; 

srcy = modeinfo.yres + height; 

savex = 0; 

savey = modeinfo.yres; 

y_ draw = modeinfo.yres - height; 
step = 2; 


// setup blit dimensions for ISR 

setblitinfo(width, height, srcx, srcy, savex, savey, y_draw, step); 
setxstart (0); 

setxpos (0); 

setimage(0, srcx, srcy); 


// determine how large to expand the scissors 
frames = 8; 

framesperwidth = modeinfo.xres / width; 
topline = frames / framesperwidth; 


if ((topline * framesperwidth) != frames) 
{ 

topline++; 
} 


topline = ((topline + 1) * height) + modeinfo.yres; 


// expand scissors to include source and save areas 
wait _for fifo(4); 

regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 

regw(SC_RIGHT, modeinfo.xres - 1); 

regw(SC_BOTTOM, topline ~- 1); 


// set background color the same as the image background 
set_fg_ color (BACKGROUND_COLOR) ; 
draw_rectangle(0, 0, modeinfo.xres, modeinfo.yres) ; 


// load source images into off-screen memory for frame blitting and 
// record their position 
frame = 0; 
i= 0; 
j = 0; 
while (frame < frames) 
{ 
// record each frame coordinate 
points[frame].x = srcex + (width * i); 
points[frame].y = srcey + (height * 4); 


// load next frame image into video memory 
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sprintf(filename, “..\\image\\frame%d.tga”, frame+1) ; 
1f (load_targa(filename, points[frame].x, points[frame].y) != SUCCESS) 
{ 
close_mode() ; 
printf(“Error loading targa file to memory.\n”); 
return (1); 
} 


// adjust location of frame load coordinate as necessary 
frame++; 

1++4+; 

1£ (1 > ((modeinfo.xres / header.width) - 1)) 


// set palette from targa color table (8 bpp) 
if (header.pixel_depth == 8) 
{ 
if (set_targa_palette(”..\\image\\framel.tga”) != SUCCESS) 
{ 
close_mode(); 
printf (“Error reading targa file color table information.\n”); 
return (1); 


} 


// wait for a key to start 
getch(); 


// setup engine for blits 

wait_for_fifo(3); 

regw(DP_SRC, FRGD_SRC_BLIT); 

regw(SRC_CNTL, 0); 

regw(DST_CNTL, DST_Y_TOP_TO BOTTOM | DST_X_LEFT_TO_RIGHT) ; 


// set vline to wait for (to prevent image tearing) and enable 
if (y_draw < (modeinfo.yres - height) ) 


{ 
set_vline(y_draw + height); 
} 
else 
{ 
set_vline(modeinfo.yres) ; 
} 


enable vilineint(); 


// enable ISR and start draw loop 
initvlineisr (IRQ); 
update = getcount(); 
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oldcount = update; 
i = 0; 
frame = 0; 
while (1 < (modeinfo.xres - width) ) 
{ 
// syncronize with ISR (vline trigger) - time out if no response 
starttick = *((unsigned int far *) (DOS_TICK_ADDRESS) ); 
endtick = starttick; 
while (update == oldcount) 


{ 
update = getcount(); 
endtick = *((unsigned int far *) (DOS_TICK_ADDRESS) ); 
1f (abs(endtick - starttick) > IRQ TIMEOUT) 
{ 
disable vlineint(); 
cancelvlineisr(); 
close_mode(); 
printf(“IRQ channel %d is not responding.\n”, IRQ); 
return (1); 
} 
} 


oldcount = update; 


// cycle through frames 
setimage(frame, points[frame].x, points[frame].y); 
frame++; 
if (frame >= frames) 
{ 
frame = 0; 
} 
// increment position 
i =i + step; 
setxpos(i); 
} 


// disable vline interrupts 
disable _vilineint (); 


cancelvlineisr(); 


// wait for a carriage return 
getch(); 


// @isable accelerator mode and switch back to VGA text mode 
close_mode(); 


return (0); 
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OldIntVector dw ? 
dw ? 
TRQnum ab 2 
TRQindex db Oah 
IRQmask1l db ? 
IRQmask2 db ? 
Count db 0 
Iwidth dw 0 
Height dw 0 
Ssrox dw 0 
SreY dw 0 
SavexX dw 0 
SaveYy dw 0 
Ydraw dw 0 
Step dw 0 
Xstart dw 0 
Xpos dw 0 
Image aw 0 
3 DO_BLIT 
; INPUT: WORD x1, 
: WORD yl, 
; WORD x2, 
; WORD y2, 
; WORD width, 
; WORD height 
3; OUTPUT: none 
; Assumes VGA aperture is enabled 


public do_blit 


ry 
, 


do_blit proc far 


3; create frame pointer 
push bp 
mov bp, sp 


3; Save used registers 
push bx 
push es 


3; Wait for fifo in FIFO_STAT (using VGA aperture) 


mov ax, 0b000h 
mov es, ax 
waitfifo: 
mov ax, es: [Of£10h) 
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or 
jnz 


ax, ax 
waitfifo 


3; Load up blit registers 


xor bx, bx 

3 SRC_X = x1 

mov ax, WORD PTR [bp+PARM] 
mov es: [(0fd84h], ax 

mov es: [O0fd86h], bx 

3 SRC_LY = yl 

mov ax, WORD PTR [bp+PARM+2] 
mov es: [(0fd88h], ax 

mov es: [0fd8ah], bx 


7 SRC_HEIGHT1 = height 
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mov ax, WORD PTR [bp+PARM+10] 
mov es: [0fd94h], ax 

mov es: [0fd96h], bx 

3; SRC_WIDTH1 = width 

mov ax, WORD PTR [bDp+PARM+8] 
mov es: [0fd90h], ax 

mov es: [0fd92h], bx 

3; DST_X = x2 

mov ax, WORD PTR [bp+PARM+4] 
mov es: [(0fd04h], ax 

mov es: [0fd06h], bx 

3; DST_Y = y2 

mov ax, WORD PTR [bp+PARM+6] 
mov es: [O0fd08h], ax 

Mov es: [O0fd0ah], bx 

7; DST_HEIGHT = height 


mov ax, WORD PTR [bp+PARM+10] 
mov es: [{0fd14h], ax 
mov es: [0fd16h], bx 


3; DST_WIDTH = width 


mov ax, WORD PTR [bp+PARM+8] 
mov es: [0fd10h], ax 
mov es: [0fd12h], bx 


+ restore saved registers 


pop es 
pop bx 


3; remove frame pointer 
mov sp, bp 
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pop bp 


do_blit endp 


“VlineISR - Handler for VLINE interrupt 


The Mach64 can have one of four IRQs set: 2, 3, 5, 10. It is assumed that 
the CRTC_VLINE_INT_EN bit is set in the CRTC_INT _CNTL register. 


3 eee ee ee we ee ee ee me ee ee ee ge we om om Gm eee me ete eee te Wee tte Gr cen Gm Oe OE UR ee OO DD De Gm a ee ee ae oe om ote ee ee Ow on Oe ee oe om oe ae ome om a ee oe oe oe ee ee 


public VlineISR 


e 
a 
° 
’ 
° 
f 
e 
t) 
° 
a 


VlineISR proc far 
eli 
push ax 
push dx 


3; Check if this interrupt trigger belongs to the VLINE interrupt 


mov ax, LoCRTC_INT CNTL 
in ax, dx 
and ax, 18h 
jnz acknowledge 
Imp skip 
acknowledge: 
3 Increment count 
inc Count 


3; Ack 8259 interrupt controllers according to the IRQ number 
cmp IRQnum, Oah 
je ack_2nd_ 8259 


3; ack master 8259 - required for IRQ 2, 3, 5 


mov dx, 20h 
mov al, 20h 
out dx, al 
cmp ITRQnum, 2 
jne ack_vline 


ack_2nd_8259: 
3; ack cascaded 8259 - required for IRQ 2, 10 


Mov ax, Oa0h 
mov al, 20h 
out ax, al 


ack_vline: 
3; ack vline interrupt bit on Maché4 
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no_restore: 


mov 
in 
or 
out 


3 worc- restore (if needed), 


7 don’t restore unless x position is more the zero 


cmp 


je 


ax, 
ax, 
ax, 
ax, 


LoCRTC_INT_CNTL 


ax 
10h 
ax 


Xpos, 0 
no_restore 


3; restore from last frame 


mov 
push 
mov 
push 
mov 
push 
mov 
sub 
push 
mov 
push 
mov 
push 
call 
pop 
pop 
pop 
pop 
pop 
pop 


ax, 
ax 
ax, 
ax 
ax, 
ax 
ax, 
ax, 
ax 
ax, 
ax 
ax, 
ax 
FAR 
ax 
ax 
ax 
ax 
ax 
ax 


3; save frame 


mov 
push 
mov 
push 
mov 
push 
mov 
push 
mov 
push 
mov 
push 
call 
pop 
pop 
pop 
pop 


ax, 
ax 
ax, 
ax 
ax, 
ax 
ax, 
ax 
ax, 
ax 
ax, 
ax 
FAR 
ax 
ax 
ax 
ax 


Height 
Iwidth 
Ydraw 


Xpos 
Step 


SaveyY 
Savex 


PTR do_blit 


Height 
Iwidth 
SaveY 
Savex 
Ydraw 
Xpos 


PTR do_blit 
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and update frame ---- 
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pop ax 
pop ax 


3; update current frame 


mov ax, Height 
push ax 
mov ax, Iwidth 
push ax 
mov ax, Ydraw 
push ax 
mov ax, Xpos 
push ax 
mov ax, SrcY 
push ax 
mov ax, Srox 
push ax 
call - FAR PTR do_blit 
pop ax 
pop ax 
pop ax 
pop ax 
pop ax 
pop ax 

skip: 
pop dx 
pop ax 
pushf 
call DWORD PTR OldintVector 
sti 
iret 


ViineISR endp 


GETCOUNT 
Retrieve the count value controlled by the ISR. 


Inputs: none 


a i, i ie i, Sn, nal | 


Outputs: WORD count 
public getcount 
getcount proc far 


7 get count 
mov al, Count 
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xor ah, ah 
ret 
getcount endp 
SETBLITINFO 


ry 
a 
° 
yf 
e 
a 
e 
f 
e 
f 
° 
f 
e 
as 
. 
rd 
° 
a 
° 
’ 
° 
r 
e 
ri 
. 
a’ 
° 
’ 
. 
Z 


Outputs: 


=e 


setblitinfo 


Inputs: WORD image width, 
WORD image height, 
WORD source xX, 
WORD source y, 
WORD save xX, 
WORD save y, 
WORD y draw, 
WORD step 


none 


public setblitinfo 


proc far 


Set blit dimension information for ISR. 


3; ¢reate frame pointer 


push bp 
mov bp, sp 


3; Save used registers 
push ax 


3; get image width 
mov ax, WORD PTR 
mov Iwidth, ax 


3; get image height 
mov ax, WORD PTR 
Mov Height, ax 


3 get source x 
mov ax, WORD PTR 
mov SrexX, ax 


3; get source y 
mov ax, WORD PTR 
mov SreY, ax 


3; get save x 
mov ax, WORD PTR 
mov Savex, ax 
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[bp+PARM] 


[bp+PARM+2] 


[bp+PARM+4] 


[bp+PARM+6] 


[bp+PARM+8] 
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3; get save y 
mov ax, WORD PTR [bp+PARM+10] 
mov SaveY, ax 


3; get y draw 


mov ax, WORD PTR [bp+PARM+12] 
mov Ydraw, ax 

3; get step 

mov ax, WORD PTR [bp+PARM+14] 
mov Step, ax 


3 restore saved registers 
pop ax 


3; remove frame pointer 


mov sp, bp 
pop bp 
ret 


setblitinfo endp 


SETXSTART 
Set image starting position for the ISR. 


Inputs: WORD x image position 


Le i a i ie iL] 


Outputs: none 


=e 


public setxstart 
setxstart proc far 

3; ¢reate frame pointer 

push bp 


mov bp, sp 


3; Save used registers 
push ax 


3; get image starting position (x) 
mov ax, WORD PTR [bp+PARM] 


mov Xstart, ax 


3; restore saved registers 
pop ax 


3; remove frame pointer 
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setxstart 


mov sp, bp 
pop bp 

ret 

endp 
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SETXPOS 


Inputs: 


Outputs: 


e 
a 


setxpos 


setxpos 


Inputs: 


e 
f 
e 
a’ 
e 
ra 
e 
f 
e 
a 
° 
ri 
e 
’ 
° 
’ 
Ld 
a’ 
e 
i 


3 

3 

3 

3; Set current image position for ISR. 
7 

; WORD x image position 

; 


public setxpos 

proc far 

3; create frame pointer 
push bp 


mov bp, sp 


3; Save used registers 
push ax 


3; get current image position (x) 
mov ax, WORD PTR [bp+PARM] 


mov Xpos, ax 


3 restore saved registers 
pop ax 


3; remove frame pointer 


mov sp, bp 
pop bp 

ret 

endp 


Set source image for ISR. 
WORD source image, 
WORD source x, 


WORD source y 


Outputs: none 


SETIMAGE 
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public setimage 

setimage proc far 
3 Create frame pointer 
push bp 
mov bp, sp 
3; Save used registers 
push ax 
3; get source image 
mov ax, WORD PTR [bp+PARM] 
mov Image, ax 
3; get image coordinates 
mov ax, WORD PTR [bp+PARM+2] 
mov Srcox, ax 
mov ax, WORD PTR [bp+PARM+4] 
mov SrcY, ax 
3; restore saved registers 
pop ax 
3 remove frame pointer 
mov sp, bp 
pop bp 
ret 

setimage endp 

GETIRQINDEX 


Inputs ;: 


me we Swe We we Se TWO Ne SO RO 


me 


Return the IRQ index given an IRQ number 


WORD IRQ num 
AX: valid values : 2 ,3 , 5 


Outputs: WORD IRQ index 
AX: valid returns : OAh, OBh, ODh, 


public getirgindex 


getirgqindex proc far 


OAh 


72h 


3; determine IRQ index based on IRQ number 


xor ah, ah 
cmp al, 2 
jne chkirg3 
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mov al, Oah 

jmp done 
chkirg3: 

cmp al, 3 

jne chkirg5 

mov al, Obh 

jmp done 
chkirg5: 

cmp al, 5 

jne chkirgl10 

mov al, Odh 

jmp done 
chkirq10: 

cmp al, Oah 

jne badirqnum 

mov al, 72h 

jmp done 
badirgqnum: 

mov al, 0 
done: 

ret 


getirqindex endp 


INITVLINEISR 


Inputs : WORD IRQnum 
AX: valid values: 2, 


e 
as 
. 
a 
° 
a 
° 
a 
e 
a 
e 
a 
° 
’ 
e 
f 
e 
4 


Outputs: none 


me 


public initvlineisr 
initvlineisr proc far 

3; create frame pointer 

push bp 


mov bp, sp 


3; save used registers 


push bx 
push ax 
push ds 


3, 


5, 


Chain in VLINE interrupt service routine. 


OAh 
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shiftleft: 


irgl10: 


push es 


7 get IRQ number 


mov ax, WORD PTR [bp+PARM] 
mov IRQnum, al 

Mcall getirqindex 

mov IRQindex, al 


3; save IRQ mask for master 8259 (IRQs 0 - 7) 


mov dx, 21h 
in al, dx 
mov TRQmaski, al 


3; save IRQ mask for cascaded 8259 (IRQs 8 - 15) 


mov ax, Oalh 
in al, dx 
mov - TROmask2, al 


3 load IRQ number and enable interrupts in appropriate 8259 
xor ch, ch 
mov cl, IRQnum 


1f IRQ 2, 3, 5 - program master 8259 
if IRQ 10 - program cascaded 8259 
bit = 0 - enable interrupt 

bit = 1 - disable interrupt 

cmp cl, Oah 

je irqi0d 


me te NO ft 


3; enable interrupt on master 8259 for IRQ 2, 3, 5 


mov dx, 2ih 

mov ah, Offh 
mov al, 1 

shl al, 1 

loop shiftleft 
sub ah, al 

mov al, IRQmask1 
and al, ah 

eli 3; Gisable interrupts during mask update 
out ax, al 

sti 

jmp continue 


enable interrupt on cascaded 8259 for IRQ 10 - IRQ 2 also needs 
to be enabled 


=e =e 


3; enable cascaded interrupt 2 on master 8259 


mov ah, Ofbh 
mov dx, O21h 3 IRQ 2 cascades to IRQ 10 
mov al, IRQmaskl 
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and al, ah 

eli 3; Gisable interrupts during mask update 
out dx, al 

sti 


3; enable interrupt 10 on cascaded 8259 


mov dx, Oalh 
mov al, IRQmask2 
and al, ah 
eli 3; disable interrupts during mask update 
out dx, al 
sti 
continue: 
7; initialize variables for ISR 
mov ax, 0 
mov Count, al 


3; get old IRQ vector address (in ES:BX) 


mov al, IRQindex 
mov ah, 35h 
int 21h 


3 Save vector address 


Mov ax, es 
mov OldIntVector, bx 
mov OldiIntVector+2, ax 


3; get address of interrupt service routine (in DS:DX) 


mov ax, offset cs:VlineISR 
mov ax, ¢s 
mov ds, ax 


3; set new vector address 


mov al, IRQindex 
mov ah, 25h 
int 21h 


3; restore registers 


pop es 
pop ds 
pop dx 
pop bx 


3; remove frame pointer 


mov sp, bp 
pop bp 
ret 


initvlineisr endp 
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me ™e Se Se MO he fe fe 


=e 


cancelvlineisr proc 


cancelvlineisr endp 


Ay en ee | ee) ee, | ee 


Inputs : none 


Outputs: none 


public 


Disable timer interrupt routine. 


cancelvlineisr 


far 


3; save used registers 


push 
push 


3; restore old IRQ vector address (in DS:DX) 


mov 
mov 
mov 


mov 
mov 
int 


ax 
ds 


dx, 
ax, 
ds, 


al, 
ah, 
21h 


OldiIntVector 
OldIntVector+2 
ax 


TRQindex 
25h 


CANCELVLINEISR 


3; restore IRQ masks - disable interrupts while updating masks 


cli 
mov 
mov 
out 
mov 
mov 
out 
sti 


ax, 
al, 
dx, 
dx, 
al, 
dx, 


21h 
IRQmask1 
al 

Oalh 
TRQOmask2 
al 


3 restore registers 


pop 
pop 


ret 


Inputs : none 


Outputs: none 


ds 
ax 


Disable system interrupts. 


INTERRUPTS_OFF 
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public interrupts_off 
interrupts_off proc far 


3; disable system interrupts 
eli 


ret 


interrupts_off endp 


INTERRUPTS_ON 
Enable system interrupts. 
Inputs : none 


° 
a 
e 
I 
° 
a 
° 
s 
e 
a 
° 
£ 
° 
s 
° 
a 


Outputs: none 


e 
# 


public interrupts_on 
interrupts_on proc far 


3; enable system interrupts 
sti 


ret 


interrupts_on endp 
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Single Buffering (Delta Framing) 


Delta framing is a method of achieving flicker-free animation without CRT 
synchronization. Only the changes from one frame to the next are drawn on the 
screen. The animation will be flicker-free because no undrawing is ever done. 
Tearing will occur, but the effects will be minimal, given the draw rate. 


Sample code SAMPLE . DOC 
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OFF-SCREEN MEMORY MANAGEMENT 


Off-screen memory management is a requirement for any real application that 
directly uses the accelerator. Hardware cursor definitions, context save areas, font 
caches, and bitmap caches are all kept in off-screen memory. Independent source 
and destination pitches and offsets, and a linear source trajectory facilitate 
implementation of an off-screen memory manager. 


Memory can be allocated in linear chunks, aligned to 64-bit boundaries. One 
possible implementation of an off-screen memory manager is show below: 


Sample code SAMPLE .DOC 
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CONTEXT CHAINS 


Context chains are used to group many draw operations into a single atomic 
operation. This is useful for improving concurrency on engine-bound operations 
(see Performance Issues in Chapter 5, Advanced Topics II). 


The CONTEXT_MASK eniry in the context load structure can be used to inhibit 
loading of any register so that chains may take advantage of draw side effects. 


Ez Note that on loading, the CONTEXT_MASK register has no effect on the 


CONTEXT_MASK entry or the CONTEXT_LOAD_CNTL entry in the context save 
structure. These two registers are always loaded. To halt a context chain, the 
CONTEXT_LOAD_CNTL entry must be set to do nothing. 


Sample code CHAIN. DOC 


This is sample code to demonstrate chained context operations. See CHAIN.C 
for more details. 


unsigned long context [64]; // context array 


/* Main C program */ 
int main(void) 


{ 


// get modal information for resolution independent operation 
xres = (unsigned long) (modeinfo.xres) ; 

yres = (unsigned long) (modeinfo.yres) ; 

pitch = (unsigned long) (modeinfo.pitch) ; 


// setup engine context and clear screen 
init_engine(); 
clear_screen(0, 0, modeinfo.xres, modeinfo.yres) ; 


// determine top of memory address 

mementl = inpw(ioMEM_CNTL) ; 

switch(memcntl & 7) 

{ 
case 0: contextaddr = 0x80000; break; // 512K 
case 1: contextaddr = 0x100000; break; // 1M 
case 2: contextaddr = 0x200000; break; // 2M 
case 3: contextaddr = 0x400000; break; // 4M 
case 4: contextaddr = 0x600000; break; // 6M 
case 5: contextaddr = 0x800000; break; // 8M 
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/* 
Fill context. Each context pointer represents 256 bytes of video 
memory. The context load address decreases as the context load pointer 
increases. Context pointer 0 points to the top of memory - 256. Some 
restrictions apply for context pointers 0-3 if the linear aperture 
size equals the memory size (1.e. 4M aperture size, 4M of video 
memory). In this case, the memory mapped registers occupy 1K of memory 
below the top of aperture. The only method to reach this area in this 
case is to use the VGA paged aperture. 

*/ 

context[0] = Oxffffffff; 

context[{1] = 0x00000000; 

context[2] = (pitch / 8) << 22; // destination pitch 

context[3] = 0x00200010; 

context[4] = 0x00400080; 

temp = (unsigned long) (modeinfo.xres / 20); 

context[3] = (temp << 16) | (modeinfo.yres / 30); // (x, y) 

temp = (unsigned long) (modeinfo.xres / 10); 

context [4] = (temp << 16) | (modeinfo.yres / 4); // (width, height) 

context[5] = 0x00000000; 

context[6] = 0x00000001; 

context[7] = Ox0003ffff; 

context[8] = (pitch / 8) << 22; // source pitch 

context[9] = 0x00000000; 

context[10] = 0x00000000; 

context[11] = 0x00000000; 

context[12] = 0x00000000; 

context [13] = 0x00000000; 

context[14] = 0x00000000; 

context[15] = xres << 16; // scissors 

context[16] = yres << 16; 

context[17] = 0x00000000; 

context[18] = get_color_code(YELLOW) ; // foreground color 

context [19] = Oxffffffff; 


// set DP_PIX_WIDTH and CHAIN _MASK according to color depth 
switch (modeinfo.bpp) 
{ 
case 4: 
context[20] = 0x00008888; 
context[21] = HOST _4BPP | SRC_4BPP | DST_4BPP; 
break; 
case 16: 
1f (modeinfo.depth == 555) 
{ 


// 555 color weighting 
context[20] = 0x00004210; 
context [{21] = HOST_15BPP | SRC_iI5BPP | DST_15BPP; 


} 


else 


{ 
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// 565 color weighting 


context[20] = 0x00000410; 
context[21] = HOST_16BPP | SRC_16BPP | DST_16BPP; 
} 
break; 
case 32: 
context[20] = 0x00008080; 
context[21] = HOST _32BPP | SRC_32BPP | DST_32BPP; 
break; 
default: 
case 8: 
case 24: 
context [20] = 0x00008080; 
context[21] = HOST_8BPP | SRC_8BPP | DST_8BPP; 
break; 
} 
context[22] = FRGD_MIX S | BKGD_MIX_ S; 
context[23] = FRGD_SRC_FRGD CLR; 
context[24] = 0x00000000; 
context[25] = Oxfffrfftff; 
context[26] = 0x00000000; 
context[27] = DST_LAST_PEL | DST_Y_TOP_TO_ BOTTOM | DST_X_LEFT_TO_RIGHT; 
context [28] = 0x00000000; 
for (1 = 29; 1 «< 64; i++) 


{ 


context [i] = 0; 


Upload contexts 


Setup 2 contexts - one chained to the other. This is done by setting 
up the CONTEXT LOAD CNTL context register (offset 28) for the current 
context load. When a context load is initiated, the CONTEXT LOAD CNTL 
register will be loaded and executed according to its contents. This 
feature allows context chaining. In this example, the first context 
will draw a YELLOW filled rectangle and the second context will draw 


// a LIGHTGREEN diagonal line. Since the second context is to be chained, the 


// the load pointer bits of the CONTEXT_LOAD CNTL register entry of the 
// FIRST context (load pointer = 4) will be set to point to the second 
// context (load pointer = 5). 

// Context load address calculation: 

// 

// Address = total video memory - (context pointer + 1) * 0x100) 

// 

/* 


*/ 
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The main focus here is to upload the context information for later 
retrieval by the engine context load feature. The method used will 
depend on the hardware setup - see CHAIN.C for more details. 
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// load up first context -> load pointer = 4 
context[28] = CONTEXT LOAD AND DO LINE | 5; // do line for next context 
upload_context (contextaddr - ((4 + 1) * 0x100)); 


// load up second context -> load pointer = 5 

temp = (unsigned long) (modeinfo.xres / 4); 

context[3] = (temp << 16) | (modeinfo.yres / 4); // (%, y) for line 
context[18] = get_color_code(LIGHTGREEN) ; // set line color 
context[28] = 0; // halt chain 
upload_context (contextaddr - ((5 + 1) * 0x100)); 


// initiate context chain 

wait_for fifo(2); // wait for 2 fifo entries 
regw(CONTEXT_ MASK, Oxfffrfrfftftf); 

regw(CONTEXT_LOAD CNTL, CONTEXT_LOAD AND DO FILL | 4); 
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Chapter 5 
Advanced Topics IT 


BOOT-TIME INITIALIZATION 


This section describes the registers required to initialize a mach64 after power-up. 
All boot-time initialization is performed in the adapter ROM. 


The scratch registers, SCRATCH_REGO and SCRATCH_REG1, may be used 
at the adapter ROM’s discretion, with the exception of the lower 7 bits of 
SCRATCH_REG1. These bits are used to communicate ROM segment 
location to applications, and must be initialized at boot-time. Typically, 
installed mode information and other flags are stored in the other bits. 


BUS_CNTL is used to configure the mach64 bus interface unit and to control 


_ FIFO error and host error interrupts. At boot-time, all interrupts should be 


disabled, and the bus interface unit must be programmed appropriately for 
the type of host expansion bus. In determining the appropriate initialization 
values, safest values should be used first, and incrementally reduced until 
minimum safe values are discovered. 


MEM_CNTL is used to configure the memory interface unit. Memory size 
must be determined by the adapter ROM, and written appropriately. Initial 
memory boundary information should be stored in the non-volatile storage 
area. All other configuration bits are first determined empirically using the 
methods described for BUS_CNTL, and later hard-coded for particular 
memory configurations. 


GEN_TEST_CNTL is used for accessing an external EEPROM, enabling overscan 
to external DACs, enabling the hardware cursor, resetting the draw engine, 
enabling VRAM block write memory cycles, and chip diagnostic functions. At 
boot-time, overscan and block write must be initialized. The hardware cursor 
must be disabled, and the draw engine must be reset and enabled. 


CONFIG_CNTL is used for initializing the linear aperture and small 
apertures, setting the card ID, and disabling the VGA. The apertures should 
be disabled on power-up, and should only be initialized when an application 
calls the ROM for aperture services. The card ID should be set to zero in 
single-card systems. The VGA disable bit should never be touched. 


CONFIG_CHIP_ID, CONFIG_STAT0, and CONFIG_STAT1 are used to 
determine board configuration for initialization, for ROM query functions, or 
for hardware debugging. 
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Of all the registers listed above, only CONFIG_CHIP_ID, GEN_TEST_CNTL, and 
SCRATCH_REG!1 should be touched by applications. CONFIG_CHIP_ID is used 
to identify a specific class and revision of accelerator, and GEN_TEST_CNTL is used 
to enable the hardware cursor and reset the draw engine. No other bits should be 
touched inGEN_TEST_CNTL. SCRATCH_REG1 is used to determine the ROM 
segment location for calling ROM service routines. 


ACCESSING THE EEPROM 


a 


The EEPROM interface pins are directly accessible from the lower four bits of 
GEN_TEST_CNTL. Note that these pins are multiplexed with display output pins, 
so garbage will appear on the display when reading or writing from the EEPROM. 
The CRTC should be disabled before any EEPROM operations. EEPROM 
operations are enabled with the GEN_EE_ ENABLE bit. 


* Note that EEPROMsS are generally very slow devices, and the 
GEN_EE_CLOCK can only be strobed at the maximum rate specified by the 
device data sheets. Please consult the manufacturer’s EEPROM specifications 
for more information on reading or writing from that device. 


¢ Note that not all configurations of mach64 will use an EEPROM for a non- 
volatile storage device. 


Sample code SAMPLE . DOC 
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DAC PROGRAMMING 


The mach64 supports DACs from many manufacturers. All will be compatible with 
VGA resolutions (640x480, 800x600, 1024x768, at 8 bits per pixel, up to 80MHz pixel 
clock). Modes beyond these require special programming considerations. 


DACs from different manufacturers will access the extended DAC registers in 
different ways. Some require a special sequence of register reads and writes to 
remap the register address space to its extended registers. Others have extra 
address lines accessible through DAC_EXT_SEL@DAC_CNTL to select extended 
DAC registers. Please consult the manufacturer’s DAC specification for more 
information. Below is sample code for programming some common DAC types: 


Sample code SAMPLE .DOC 
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DIAGNOSTIC FEATURES 


Numerous self-diagnostic features have been designed into the mach64. Test modes 
are set using the GEN_TEST_CNTL register and the scratch registers perform 
different functions depending on the set mode. The test modes available are listed 
below: 


Test Mode 0, All Test Features Disabled 


The scratch registers are used as scratch registers. 


Test Mode 1, Memory Read/Write Test 


SCRATCH_REG0 is used as an indirect address register, and SCRATCH_REG1 isa 
data register. It is conceptually convenient to rename these register as TEST_REGO 
and TEST_REG1. Strobing the GEN_TEST_MEM_STROBE@GEN_TEST_CNTL bit 
will read or write 32 bits of data into TEST_REG1 at the DWORD address specified 
in TEST_REGO. If GEN_TEST_MEM_WR@GEN_TEST_CNTLis set, data is written 
to memory from TEST_REG1; otherwise, data at the specified address is loaded into 
TEST_REGI. 


Test Mode 2, Source and Destination Length Test 


The addresses of SCRATCH_REGO and SCRATCH_REG!1 are remapped to point 
to internal counters. These are referred to as TEST_REG2 and TEST_REG3 and are 
read- only. By putting the chip into single step mode 
(GEN_TEST_DST_SS_EN@GEN_TEST_CNTL or 
GEN_TEST_SRC_SS_EN@GEN_TEST_CNTL), and by watching these registers 
while strobing one of the single step bits 
(GEN_TEST_DST_SS_STROBE@GEN_TEST_CNTL or 
GEN_TEST_SRC_SS_STROBE@GEN_TEST_CNTL), the integrity of these counters 
can be assured. 


Test Mode 3, Source FIFO Read Length Counter Test 


Similar to test mode 2, except that SCRATCH_REGO is mapped to the source FIFO 
read length counter and is referred to TEST_REG4. The size of the source FIFO is 
4x64. 
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Test Mode 4, CRTC Test 


The internal CRTC counters may be tested in this mode by single stepping the CRTC 
with GEN_TEST_CC_EN@GEN_TEST_CNTL and 
GEN_TEST_CC_STROBE@GEN_TEST_CNTL. TEST_REG5 and TEST_REG6 are 
used to determine the internal state of the CRTC counters, flags, and comparators. 


Test Mode 5, Display CRC Test 


A CRC check of the display output can be done in this mode. It provides a 
convenient method of checking the DAC interface, overscan, and hardware cursor 
circuitry. Initiate the CRC by writing to the 
GEN_TEST_CRC_STR@GEN_TEST_CNTL bit, and waiting for the 
GEN_TEST_CRC_DONE@GEN_TEST_CNTL bit to go high. TEST_REG7 contains 
the accumulated CRC. 


Other Test Features 


Other test features include: 


- © The GEN_TEST_FIFO_EN@GEN_TEST_CNTL bit, which halts consumption 
of the command FIFO so that all FIFO entries may be guaranteed to be filled 
and tested. 


* The GEN_TEST_GUI_LREGS_EN@GEN_TEST_CNTL bit, which prevents 
draw operations and context loads from initiating. This allows all registers to 
be written and read without causing the draw engine to do unwanted draw 
operations. 


All other registers, data paths, sequencers, and draw functions may be tested using 
standard draw operations, and comparing the results against known data. 
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PERFORMANCE ISSUES 


Performance is a complex issue, which requires a clear definition of the terminology 
and an explanation of the factors affecting graphics performance. 


Redundancy 


Redundancy is the duplication of information. Most draw operations are 
redundant in that the same pixel or pattern of pixels is repeatedly written into 
memory. Since host expansion buses (ISA, EISA, MCA, VLB, PCI) tend to be slow, 
draw operations performed by the host CPU tend to be slow as well. Graphics 
accelerators improve performance by reducing the amount of redundant 
information traveling across the host expansion bus by simply specifying the type 
of pixel information to be written and the draw trajectory. 


Any operation whose draw information cannot easily be reduced (such as a host- 
to-screen bitmap transfer) should do direct memory writes into the linear frame 
buffer instead of being drawn by the draw engine, because draw setup overhead 
will slow the operation. 


Draw Speed 


Draw speed isa raw measure of how fast the draw engine can put pixels to memory, 
measured in pixels per second. Many benchmark programs do not measure draw 
speed correctly because they do not factor in concurrency. 


Concurrency 


Concurrency is the inherent ability of graphics accelerators to perform a draw 
operation at the same time that the host CPU is doing something else. An 
accelerator is a fixed-function processor that performs dedicated tasks and relieves 
the CPU to do other tasks. Concurrency and reduction of redundancy are the 
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PERFORMANCE ISSUES 


primary reasons why graphics accelerators are faster than dumb frame buffer 
devices, such as the VGA. 


Efficiency 


Efficiency is a measure of concurrency. Maximum efficiency in a software process 
is achieved when the host is never idle and the draw engine is never idle (this never 
happens). Efficiency will be affected by draw speed, CPU speed, FIFO depth, and 
order of draw operations (for example, a draw engine operation followed by a linear 
frame buffer access requires a wait-for-engine-idle in between, which causes the 
CPU to idle, thus decreasing efficiency). 


Performance should be measured on both slow CPUs and fast CPUs, because 
efficiency differs radically from system to system. 


Expansion Buses 


There are currently five different expansion bus standards for X86 platforms: 


° ISA 
° EISA 
°° MCA 
° VLB 
° PCI 


Each differs in maximum and typical throughput. Bus type will only affect the 
performance of host-to-screen and screen-to-host transfers. Most other draw 
operations have very low redundancy, and bus transfer times are very small. 


VRAM vs. DRAM 


The CRT controller needs to fetch screen data at pixel clock rates. VRAM is dual- 
ported, and allows the CRTC to read memory while the draw engine is accessing 
the same memory. Some VRAMs also have specialized circuitry to do fast 
monochrome expansions. Accesses to DRAM must be arbitrated between the 
CRTC and draw engine (the CRTC has priority), thus reducing the total number of 
memory cycles available to the draw engine, and slowing draw speed. In DRAM 
configurations, graphics mode has a direct correlation to draw speed, because the 
higher the pixel clock rate, the less available memory bandwidth. 


In summary, VRAMs are faster than DRAMs, but the difference will only be 
perceptible and measurable ifthe target application is engine-bound (i.e.,CPU must 
wait for draw engine = poor efficiency). | 


ie Note that graphics benchmark programs are atypical because they have inherently 
low efficiency. 
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Block write 


PERFORMANCE ISSUES 


Block write is a high-speed color fill feature of VRAMs and some specialized types 
of DRAMs. Four consecutive addresses may be filled with a solid color in the time 
it takes to do a single memory access. 


The mach64 uses block write ifGEN_BLOCK_WR_EN@GEN_TEST_CNTLis enabled, 
the foreground mix is set to paint (function 7), the background mix is set to transparent 
(function 3), the color compare function is set to FALSE, WRT_MASK is set to all 1's, 
destination pixel size is 8, 15, 16, or 32bpp, and DST_24_ROT_EN@DST_CNTL is 
disabled. Any monochrome source may be used. It is the adapter ROM’s 
responsibility to enable GEN_BLOCK_WR_EN@GEN_TEST_CNTL at boot time if a 
compatible type of memory is detected. 


Memory Bandwidth 


Memory bandwidth is a measure of the number of memory accesses per second, 
which is easily quantifiable. On the mach64,a memory access to a current page 
costs two cycles; a page faulted memory access costs seven cycles. 


Page A page is defined as 512 addresses, where the data width may be 32 bits or 64 bits 
wide, depending on memory configuration. The frequency of page faulting 
depends on the burst rates of the various devices contending for the memory bus. 
The source FIFO is 4x64, and the CRTC video FIFO is 16x64. 
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Performance 


Performance is a holistic measure, and depends greatly upon host configuration, 
accelerator configuration, and application efficiency. Performance cannot be 
quantified in a single measure. 


¢ System performance can be improved with faster host and accelerator 
configurations. 


¢ Application performance on a fixed hardware configuration can be improved 
by reducing redundancy and improving efficiency on a particular target 


system. 
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BIOS Services 


INTRODUCTION 


All accelerator services are provided in the accelerator ROM segment at offset 64h. 
The ROM segment is determined by reading the SCRATCH_REGI1 register and 
calculating: 


SEGMENT = (SCRATCH_REG1 & 0x7F) « 0x80 + 0xC000 
where SCRATCH_REG1 is 046ECh 


Function codes are placed in AL. Functions are called by first doing a PUSHF, and 
then far calling to SEGMENT:64h. 


All functions return with error code in AH 


Function 0, load coprocessor CRTC parameters 
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Function 0, load coprocessor CRTC parameter: 


PRG888GX0-01 
A-2 


Function 0, load coprocessor CRTC parameters (cont’d) 


Function _1, set display mode 


Function 2, load coprocessor CRTC parameters and set display mode 


Function 3, read EEPROM data 


Function 4, write EEPROM data 
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Function 6, short query function 


Function 6, short query function 


Function 7, return hardware capability List 


In return DX:BX = offset into a table specifying the maximum dot clock information; 
the table is terminated by a zero in the first column (see below). 


H_DISP = Horizontal resolution, in number of 
characters. 
DAC_MASK = (1 shl dactype) 
Memory = Minimum memory required to support 
Requirements the specified resolution and color depth 
(DRAM requirement shl4) or (VRAM 
requirement) 
Maximum = Maximum dot clock with the specified 
Dot Clock resolution and color depth, in MHz 
Pixel Width = Color depth 
© 1993 ATI Technologies Inc. PRG888GX0-01 
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Function 8, return query device data structure in bytes 


Function 8, return query device data structure in bytes 


Function 9, query device 


Function OAh, return clock chip frequency table 


Function OBh, program a specified clock entry 


Function 0Ch, DPMS service, set DPMS mode 
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Function ODh, return current DPMS state in CL 


Function ODh, return current DPMS state in CL 


Function OEh, set graphics controller’s power management state 


Function OFh, return 
current graphics controller’s current power management state 


Function 10h, set RAMDAC to different states 


Function 11h, return external storage device information 
(INSTALL should use this information to dynamically configure the data structure) 
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Function 12, Short Query 


Function 12, Short Query 


Nu dda of mode tables 


Size of each mode table in bytes 
ASIC identification . 


VGA Type: 
0 =disabled 
1 =enabled 


VGA Boundary: 
= full access 
= 256K 
= 512K 
= 768K 
=1M 
Oh= no access through VGA 
emory Size: 
= 512K 
=1M 
=2M 
=4M 
= 6M 
= 8M 
=12M 
= 8M 
its 3-0, DAC Type: 
= Reserved 
= Reserved 
= TI 34075/ATI68875 
= Brooktree BT476/8 
= Brooktree BT481, AT&T20C490/491, $C15025/15026, 
IMS-G174, MU9C4910, MU9C1880 
= ATI68860 
= $TG1700 
= §C15021 
its 7-4 = Reserved 


0 
1 
2 
3 
4 
1 


<= 


RWN OD NOM PR WH - © 


DWN AY 
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Query Structure 


Memory Type: 
0 = DRAM 256Kx16 

1 = VRAM 256Kx4 

2 = VRAM 256Kx16 

3 = DRAM 256Kx4 

5 = VRAM 256xé4 special 


fe 6 = VRAM 256x16 special 
Bus Type: 
O0=ISA 
1=EISA 
2 = Reserved 
3 = Reserved 
4 = Reserved 
5 = Reserved 
6 = VLB 
7 =PCl 
| 10h:11h 


ODh 

OEh 

OFh Bit 7 - Enable composite sync 
Bit 6 - Enable sync on green 

13h 

14h 


10h:11h | Aperture address in megabytes (0-4095) 
Aperture configuration (see extended BIOS function AL=6) 
5=1 ; support 256 grey scale 


4=1  ; support sleep mode 


16h:17h_ | Offset into current mode table if non-zero (not implemented) 


I/O base address 


= Mode tables immediately follow the device status table. Use the forward pointer 
to reference mode tables, as the device status table may expand in the future. It is 
possible to have no modes installed. Typically, between two and seven mode tables 
will be returned. 


Color Depth Support 
Bit Definition 

P= ; Reserved 

6= ; Reserved 

= : Reserved 
; support 32bpp (unpack 24 bpp) 
; support BGR in 24bpp 
; support RGB in 24bpp 
; support 16 bpp, 555 
; support 16 bpp, 565 
RAMDAC support feature 
Bit Definition 

7=1 ; support sync On green 

6=1 ; support gamma correction 


OnmrnNWA 
hou wm wou 
on oe ee er 
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Mode Table Structure 


Installed Mode Table 1 
0:1 Horizontal display resolution, in pixels 
2:3 Vertical display resolution, in scanlines 


Maximum pixel depth (see function 0, CL[3-0], on page A-1 for 
interpretation) 


5 Reserved 


Offset into EEPROM 
=0  ;Table is generated from VGA parameters 
<>0__ ;offset into EEPROM table 


DAhOBh 


Bits 15:11= Reserved 
Bit 10 = Enable MUX mode 
Bit 9 = Enable interlace 

Bit 8 = Enable double scan 
Bits 7:0 = Reserved 


OEh CRTC_H_TOTAL 
Fh as 


N 


OCh:0Dh 


10h 
Th 
12h gh 


Bits 15 -12 = CRTC_H_TOTAL_DLYBit 3 - 2 = OVR_WID_LEFT 
Bits 11 - 8 = CRTC_H_SYNC_DLY 

Bits 7 - 4 = OVR_WID_RIGHT 

Bits 3 -0 = OVR_WID_LEFT 


1Eh:1Fh OVR_WID_TOP, OVR_WID_BOTTOM 
20h:21h OVR_CLR_B, OVR_CLR_8 
22h:23h OVR_CLR_G, OVR_CLR_R 


Installed Mode Table 2 


24h:47h Entries definition same as mode table 1. 


1Ch:1Dh 


Installed Mode Table n 


N*24h- Entries definition same as mode table 1 
(N*24423h) 
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EEPROM Map 


EEPROM Write Counter ii 
Reserved 


7 


on) 


; EEPROM checksum, modular 8 of 8-bit data; the sum of all the entries 
° in the EEPROM must be 0 


—_ 


5 


Reserved — no application program should change the factory default, 


zero 


Reserved 


EEPROM table revision 


Custom monitor indices 


15 
1280x1024 refresh rate information 
Sh 


—_ 


6 [a tiknable oats 72H 
0 [erable composite yng 
Te [Reseed 
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CX EEPROM DATA STRUCTURE 


Reserved 


: = 1; Select 1024x768 in 72Hz 
= 1; Select 1024x768 in 70Hz 
= 1; Select 1024x768 in 60Hz 


oe = 1; Select 1024x768 in 87Hz interlaced 


Power-Up Video Mode: 
3 = VGA color - secondary 
15:8 | 5 = VGA monochrome - secondary 
9 = VGA color - primary 
B = VGA monochrome - primary 


Monochrome Mode Color Select: 
0 = White 
1 = Green 


5 Dual monitor enable 


Font Selection at power-up: 
0 = 8x14 or 9x14 
1 = 8x16 or 9x16 


Zero Wait State Ram 
0 = Disabled 
1 = Enabled 


Zero Wait State ROM 
0 = Disabled 
1 = Enabled 


16 BitROM 
0 = Disabled 
1 = Enabled 
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CX EEPROM DATA STRUCTURE 


Host data transfer width: 

0 = Auto select 
1 = 16-bit 

2 = 8-bit 

3 = 8-bit host, 16-bit others 


15:14 
rr 


VGA boundary: 
0 = No boundary 
1 =512KB 

2 = 1MB 


Monitor Alias enable 


Aperture Location (in MByte) 
1 


5:4 


Mouse address: 
00h = Mouse disabled 

O8h = Secondary address selected 
18h = Primary address selected 


Interrupt level: 
20h = IRQ5 
28h = IRQ4 
30h = IRQ 3 
38h = IRQ 2 


Aperture Size (BIOS will not use this value if Aperture Location is a 
3:0 | non-zero value — aperture size will be based on amount of video 
memory installed) 
i RSE Sa 
Po CRT parameter table 1 
26h:34h is ek RT parameter table 2 
35h:43h ae tl CRT parameter table 3 | 
44h:52h cal CRT parameter table 4 


h 
h 


53h:61h CRT parameter table 5 
62h:70h CRT parameter table 6 


71h:7Fh CRT parameter table 7 
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CX CRT PARAMETER TABLE 


Video mode select 1 
Video mode select 2 

Video mode select 3 
“ee 


CRT usage (VGA only) 
O = Use sync polarities only 
1 = Use all CRT parameters 


4 
Reserved 
| 15:8 | 


gp |_15:8_| MAX_SCAN_LINE (CRTO9) 
H_TOTAL (CRTOO) 
a H_RETRACE_END (CRTOS) 
V_RETRACE_STRT (CRT10) 
H_BLANK_STRT (CRT02) 
. 


CLOCK_CNTL 


V_BLANK_STRT (CRT15) CRTC_V_SYNC_STRT (7:0) 
lf == Offh or == programmable 
h entry in clock chip, use Dot Clock 
in entry 9 and programmable entry 
V_TOTAL (CRT06) CRTC_V_SYNC_WIDTH 
51 \V_DISP_END (CRT12) Dot clock (15:8) 
7:0 | CRT_MODE (CRT17) Dot Clock (7:0) 
PRG888GXO0-01 © 1993 ATI Technologies Inc. 


CRTC_OVERFLOW (CRTO07) 

15:8 
in dot clock. 
bits 15:12 = Reserved 

mes bits 11:8 = CRTC_H_SYNC_DLY 

; bits 7:4 = OVR_WID_RIGHT 
bits 3:0 = OVR_WID_LEFT 
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CX CRT PARAMETER TABLE 
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SCRATCH REGISTERS 


PRG888&GX0-01 
B-6 


SCRATCH_REGO + 1 (42EDh) | 800x600 refresh rate information 
SCRATCH_REGO + 3 (42EFh) 1024x768 refresh rate information 
SCRATCH_REG1 (46ECh) ROM location 


fe 
[SCRATCH REGI +2 @eEER) | OOCSC~—“SCSCSCS 
ee 


640x480 refresh rate information 


monochrome mode; color information 
Set to VGA display if intO is called 


1CE/BB 


SCRATCH_REG1 + 3 (46EFh) Programmable dot clock information 
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CR T Parameters 


640x480 60Hz NON-INTERLACED 


0x20C 
0x1 DF 
Ox1EA 
Ox22 


Screen Total 0x63 V_TOTAL 

Screen Display V_DISP 

Syne Start V_SYNC_STRI 

Sync Width V_SYNC_WID 
Resolution 480 
Scan Frequency 59.94Hz 
Polarity 
Sync Width 0.064ms 
Back Porch 1.017ms 
Active Time 15.253ms_ 
Blank Time 1.430ms 


1 
— 
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640x480 60Hz NON-INTERLACED 


640x480 60Hz NON-INTERLACED 


Screen Total V_TOTAL__ 
Screen Display V_DISP 
Sync Start V_SYNC_STRT 
Sync Width H_SYNC_WID V_SYNC_WID 
Resolution 480 

Scan Frequency 59.94Hz 

Polarity E 
Sync Width 0.064ms 
Front Porch 0.350 ms 
Back Porch 1.017ms 
Active Time 15.253ms 
Blank Time 1.430ms 


640x480 72Hz NON-INTERLACED/32 


0x207 
Ox1 DF 
Ox1E8 
0x23 


Screen Total V_TOTAL 

Screen Display V_DISP 

Sync Start V_SYNC_STRT 

Sync Width V_SYNC_WID 

Resolution 480 
(-) ; 


Scan Frequency 71.89Hz 
Polarity ae Cea ee 
Syne Width 0.080ms 

| 28lines 


! 
— 


Back Porch __ 4.750us 0.749ms 28lines 
Active Time 20.000us 12.840ms 480lines 
Blank Time 6.750us 1.070ms 
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640x480 72Hz NON-INTERLACED/40 


640x480 72Hz NON-INTERLACED/40 


[Front Porch | 1.600us_[  @chars | 1.120ms_[ 50 lines | 
[Back Porch _—+4|—_1.600us_—*| chars 


800x600 89Hz INTERLACED 


H_TOTAL V_TOTAL Ox2BC 
Screen Display H_DISP V_DISP 0x257 
H_SYNC_STRT V_SYNC_STRT 0x262 


Sync Width H_SYNC_WID V_SYNC_WID Ox2C 


Resolution ae eee ee ee, 
[Sync Width | 0.985us_ | chars | .191ms_ |Z ines | 
|BlankTime | 7-138us_ | 29 chars | 1.604ms_ | 101 lines _| 
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800x600 95Hz INTERLACED 


800x600 95Hz INTERLACED 


Screen Total V_TOTAL 0x2BC 

Screen Display V_DISP 0x257 

Sync Start | H_SYNC_STRT —s| Ox6D.———s|: V_SYNCC_STRT 0x262 

Sync Width V_SYNC_WID OxC 

Resolution 
(-) 


Scan Frequency 96.53Hz 
Polarity Se 
Sync Width 0.177ms 
Front Porch 0.163ms 


— 


Back Porch 1.556us 1.153ms 
Active Time 22.222us 100 chars 8.867ms 600 lines 
7.333us 


Blank Time 


1.493ms 101 lines 


800x600 56Hz NON-INTERLACED 


fein ee, 


PRG888GX0-01 © 1993 ATI Technologies Inc. 
C-4 Proprietary and Confidential 


800x600 60Hz NON-INTERLACED 


800x600 60Hz NON-INTERLACED 


H_TOTAL V_TOTAL 0x273 
Screen Display H_DISP V_DISP 


H_SYNC_WID 
|Resolution | OD 
) 
[Sync Width | 3.207us | te chars | 0-106ms_ | ines 


800x600 70Hz NON-INTERLACED 


V-TOTAL 03278 
V_SYNCSTRT | 0260 
VSYNCWID | oc 
70.04Hz 

0.26978 
0.202ms | Sines 
0.337ms 
13.470me 
0808s 


-_ 
1 


—* 
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800x600 72Hz NON-INTERLACED 


800x600 72Hz NON-INTERLACED 


cote ee 
Polarity 


800x600 76Hz NON-INTERLACED 


Screen Total V_TOTAL 

Screen Display V_DISP 

Sync Start V_SYNC_STRT 

Sync Width V_SYNC_WID 

Resolution 
(-) 


Ox2B1 
0x257 
Ox27E 
0x24 


Scan Frequency 76.01Hz 
Polarity i eae pne.” See 
Syne Width 0.076ms 
Front Porch 0.744ms 
Back Porch 0.896ms 
Active Time 11.441 ms 
Blank Time 1.7 16ms 


1 
~~ 


4 lines 
39 lines 
47 lines 

600 lines 
90 lines 
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1024x768 87Hz INTERLACED 


1024x768 87Hz INTERLACED 


Screen Total 


H_TOTAL }0x9D «| V_TOTAL 0x330 
Screen Display H_DISP V_DISP Ox2FF 
Sync Start H_SYNC_STRT V_SYNC_STRT 


Sync Width H_SYNC_WID V_SYNC_WID Ox8 


Resolution 768 

Scan Frequency 86.96Hz 
Polarity 
Sync Width 0.113ms__ |B lines__| 
Front Porch 0.014ms 


= 


Back Porch 1.069us 0.563ms 
Active Time 22.806us 128 chars 10.810ms 768 lines 
Blank Time 5.345us 0.690ms 


1024x768 56Hz NON-INTERLACED 


Ox9C V_TOTAL 
Screen Display V_DISP 
Syne Stat V_SYNC_SIRT 
Sync Width V_SYNC_WID Ox9 
Resolution 76 
Scan Frequency | 56.09Hz 
Polarity ) 
Syne Width oz00ms |S Tines 


Screen Total 


Back Porch 0.288m: 
pative The 17.031ms 
Blank Time 0.796ns 
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| 1024x768 60Hz NON-INTERLACED 


1024x768 60Hz NON-INTERLACED 


Screen Total 


H_TOTAL V_TOTAL 
Screen Display H_DISP V_DISP Ox2FF 
Sync Start H_SYNC_STRT V_SYNC_STRT 


Sync Width H_SYNC_WID V_SYNC_WID Ox4 


Resolution 768 

Scan Frequency 60.45Hz 
Polarity ) 

Sync Width 0.083ms 
Front Porch 0.021ms 
Back Porch 0.558ms 
Active Time 15.880ms 
Blank Time 0.662ms 


1024x768 66Hz NON-INTERLACED/75 


Screen Total H_TOTAL V_TOTAL 
Screen Display H_DISP V_DISP 
Sync Start H_SYNC_STRT V_SYNC_STRT 0x307 


Sync Width H_SYNC_WID V_SYNC_WID Ox4 


Resolution 768 
Scan Frequency 
Polarity | 
Syne Width 0074s 
Front Porch 0.146ms | 6 lines 
Back Porch 0.668ms 
Active Time 13.653us 14.254ms 
Blank Time 0.891ms 


' 
— 
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1024X768 66Hz NON-INTERLACED/72 


1024X768 66Hz NON-INTERLACED/72 


Screen Total 


V_TOTAL 
Screen Display V_DISP 
Sync Start V_SYNC_STRT 
Sync Width H_SYNC_WID V_SYNC_WID Ox4 

Resolution 76 
Scan Frequency 66.13Hz 
Polarity W) 
Sync Width 0.074ms 
Front Porch 0.148ms 
Back Porch 0.667ms 
Active Time 14.232ms 
Blank Time 0.889ms 


4 lines 


36 lines 
768 lines 
48 lines 


1024X768 70Hz NON-INTERLACED 


Screen Total 
HLSYNC_WID Ox8 
Senrredieny 
Polarity 
768 line 
Blank Time 36 lines 
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1024x768 72Hz NON-INTERLACED 


1024x768 72Hz NON-INTERLACED 


H_TOTAL V_TOTAL 0x325 


VTOTAL 
H_SYNC_WID 
| Q 


1024x768 76Hz NON-INTERLACED 


V_TOTAL 
V_DISP 
V_SYNC_STRT 
V_SYNC_WID Ox4 
768 
76.02Hz 
) +) 
0.065ms_ 
0.130ms 
0.440ms 
12.518ms 
0.636ms 


4 lines 

8 lines 
27 lines 
768 lines 
39 lines 
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1120x750 87Hz INTERLACED 


1120x750 87Hz INTERLACED 


H_TOTAL V_TOTAL 0x330 


1120X750 60Hz NON-INTERLACED 


Screen Total H_TOTAL OxAB V_TOTAL 
Screen Display 0x2ED 


V_DISP 
Sync Start V_SYNC_STRT 
Sync Width V_SYNC_WID Ox4 
Resolution 750 

Scan Frequency 
Polarity 4) 
Syne Width 0.085 
Front Porch 0.127ms | __é lines 


Back Porch 1.723us 0.572ms 
Active Time 17.23 1us 140 chars 15.877ms 750 lines 
Blank Time 3.938us 0.783ms 
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1120X750 70Hz NON-INTERLACED 


1120X750 70Hz NON-INTERLACED 


a @ 
Front Porch | 1.00us__| 10chars | _0.263ms | 15 lines 


1280x1024 87Hz INTERLACED 


Screen Total OxC7 
Ox9F 
OxA9 V_SYNC_STRT 
OxA V_SYNC_WID 
1024 
87.03Hz 
(+) 
0.1000 
0.500ms 
0.650ms 
| 10.240ms 
Blank Time 1.250ms 


V_TOTAL 
V_DISP 
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1280x1024 95Hz INTERLACED 


1280x1024 95Hz INTERLACED 


Screen Total V_TOTAL 
Screen Display Ox9F V_DISP 

Sync Start V_SYNC_STRT _ 
Sync Width H_SYNC_WID V_SYNC_WID 
Resolution 

Scan Frequency 

Polarity 


Sync Width 10 chars 0.100ms 10 lines 


Front Porch {| 1.000us_ | 10 chars_ | _0.010ms 


Back Porch 2.000us 0.180ms 
Active Time 16.000us 160 chars 10.240ms 1024 lines 
Blank Time 4.000us 0.290ms 


1280x1024 60Hz NON-INTERLACED 


OxD6 
Screen Display Ox9F 
Sync Start OxA9 V_SYNC_STRT 
Sync Width OxE V_SYNC_WID Ox5 
Scan Frequency 
Polarity (¥) 
Sync Width 14 chars _0.078ms 


V_TOTAL 
V_DISP 


Screen Total _ 


Front Porch  0.016ms 
Back Porch 0.579ms 
Active Time 16.012ms__ 
Blank Time 0.672ms 
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1280x1024 70Hz NON-INTERLACED 


1280x1024 70Hz NON-INTERLACED 


H_TOTAL V_TOTAL 0x429 
Screen Display H_DISP V_DISP Ox3FF 


0 


- 
— 


) 
0.067ms 
0.013ms 
0.482ms 
13.718ms 
0.563ms 


1280x1024 74Hz NON-INTERLACED 


V_TOTAL 

V_DISP 
V_SYNC_STRI 
V_SYNC_WID 
7aciiHz 
038005 
0.000m 
0.127 
72.986m: 
0.507 


0x427 
Ox3FF 
Ox3FF 
Ox1E 


30 lines 

O lines 

10 lines 
1024 lines 
40 lines 
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Register Summary 


The mach64 ASIC has the following five register classes: 
° VGA registers 
¢ Setup and control registers 
¢ Accelerator CRTC and DAC registers 
* Draw engine context control registers 


* Draw engine trajectory control registers. 


VGA REGISTERS 


VGA registers are completely segregated from the accelerator registers. Their 
functions are mutually exclusive. They are addressed at I/O ports 1CE-1CFh, 
3B0-3BFh, 3C0-3CFh, 3D0-3DFh. See the mach64 Accelerator VGA Registers Guide for 
more details. 
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SETUP AND CONTROL REGISTERS 


SETUP AND CONTROL REGISTERS 


Setup and control registers are memory-mapped, and are also aliased at I/O base 
address 2EC-2EFh. Most of these registers are initialized once only, at boot time. 


Es 


42ECh 
SCRATCH REGI 46ECh 
BUS_CNTL ech [28h —*4| kw 
3 
3 


Es 


2|2 


ISCRATCH.REGOD 
rsuscNT i 
TMEM.CNTE i 
MEM.VGA_WP_SEL 56Ech | 20h_——+| RW 
MEM.VGA_RP_SEL sazch [2th | RW 
[oe TESTCNTL i 
[CONFIG_CHIP_ID ch 
[CONFIG-STAT. oh 


2|2 


0 
8 
MEM_CNTL s2ech_ | Ch S| RW 
2D 
4 


GEN_TEST_CNTL [eeech [aah dR 
CONFIG_CNTL [—eaéch | dR 

[tech [eh 
[—yaech [3h «i oR 


2|2 
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ACCELERATOR CRTC AND DAC REGISTERS 


ACCELERATOR CRTC AND DAC REGISTERS 


The CRTC and DAC registers are memory-mapped, and are also aliased at I/O base 
address 2EC-2EFh. The accelerator CRTC registers are separate from the VGA 
CRITIC registers. 


| RW 
| RW 
| RW 
| RW | 


26ECh 


CUR_HORZ_VERT_OFFSET 


DAC_RECS SEECh 
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DRAW ENGINE CONTEXT CONTROL REGISTERS 


DRAW ENGINE CONTEXT CONTROL REGISTERS 


Draw engine context control registers are memory-mapped. They are used to set 
up the draw engine data path and destination mixing logic. 


Pattern register 0, for fixed patterns 


PAT_REGO R/W 
PAT_REG1 Ath Pattern register 1, for fixed patterns 


; 


ALU carry chain mask (only used for (D+S)/2) 


Pixel width, for setting source, destination, 
and host pixel widths 


Mix, for setting foreground and background 
mixes 


Aah 
Aah |W 
ACh 
W 

Bzh 


DP_PIX_WIDTH B 
DP_MIX B 
DP_SRC 


CLR_CMP_CLR 
CLR_CMP_MSK 


B 


B 0 
DP_WRITE_MASK 2 
DP_CHAIN_MASK B3 

a 4 
2 5 


h 


Source, for setting mono source, and 
foreground and background sources 


Compare color 
Compare mask 


w 
on) 
= 


| 2 (Be 2 [2 | 2 (22/22 <2}2\<|2I2)5182 


a) 
io) 
a 


CLR_CMP_CNTL Coh Compare control, for setting compare 
function and compare source 
FIFOLSTAT Cah 


. 
Z 


CONTEXT_MASK CB RW joes load mask, for selectively loading 
raw engine registers 


Context save pointer, in units of 64 
DWORDs. Writing to this register saves the 
engine context 
Context load pointer, for restoring an engine 
context with an option to perform a draw 
operation 
Trajectory control, for setting DST_CNTL, 
SRC_CNTL, HOST_CNTL, and PAT_CNTL 
with a single memory access 


Engine status 


@) 
> 
= 


rie | 2] 2 


CONTEXT_SAVE_PTR 


CONTEXT_LOAD_PTR 


GUI_TRAJ_CNTL CCh 


GUI_STAT CEh 
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DRAW ENGINE TRAJECTORY CONTROL REGISTERS 


DRAW ENGINE TRAJECTORY CONTROL REGISTERS 


Draw engine trajectory control registers are memory-mapped. They are used to set 
up the source and destination trajectories and to initiate draw operations. 


Destination offset and pitch 
Destination X 


DST_Y 42h Destination Y 


DST_Y_X 43h Destination Y and X 
es | Destination width 
DST_HEIGHT Destination height 
Destination height and width. 
Destination X and width 
es | Bresenham line length 


resenham error term 
Bresenham axial step term 


Bresenham diagonal step term 


Destination control, for setting 
destination trajectory direction, 
destination side effects, line and 
polygon options, and packed 24 bit 
initial rotation value 


Source offset and pitch 


Source X 
Source Y 
Source Y and source X 


Source width1 — for setting the 
source width for unbounded Y and 
general pattern sources, for setting 
the minor source width for general 
patterns with rotation 


Source height! — for setting the 
source height for general pattern 
sources, for setting the minor 
source height for general patterns 
with rotation 


Source height! and source width1 


Source X start — for setting the 
SRC_X_START starting X location for general 
patterns with rotation 
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Source Y start — for setting the 
starting Y location for general 


68h R/W 
patterns with rotation 


SRC_Y_X_START | 69h | W |__| Source Y start and source X start 
Source width2 — for setting the 
SRC_WIDTH2 6Ah R/W major source width for general 
patterns with rotation 
SRC_HEIGHT2 6Bh R/W major source height for general 
patterns with rotation 
SRC_HEIGHT2_WIDTH2 | 6Ch_| W |__| Source height 2 and source width2 
Source control — for setting source 
SRC_CNTL trajectory type and source trajectory 
modifiers 
80:8Fh | W but mapped to 16 separate 
locations 
HOST_CNTL 90h RW Host control — for setting host 
consumption modifiers 


SRC_Y_START 


Source height2 — for setting the 


Host registers 0 to 15 are identical 


HOST_DATA[0-15] 


MEMORY MAPPING 


PRG888GX0-01 


All memory-mapped registers reside at offset 3FFCOOh from the base aperture 
address on 4MB boards or 7FFCO0h on 8MB boards. If VGA is enabled, these 
registers are aliased at offset FCO0h from B0000h. 
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Sample Code 


mach64 SAMPLE CODE 


Microsoft® C +All C source sample code is compatible with Microsoft® C compilers (version 5.1 


ryan sand and newer). All assembler source sample code is compatible with Microsoft® 
compilers MASM compiler version 5.1. 


ATIM64.INC 


ATIM64.INC 


ASM include file that contains ATI MACH64 register definitions 


* 
rd 
e 
a 
° 
a 
e 
ro 
a 
° 
a 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


3 SS SSS SS SS SS SSS SS SS SSS SSS SS SS SSS SS SS SS SS SS SS SS SS SS SSS SSS SSS SS SS SS SSS SSS SSS SES 


3 NON-GUI IO MAPPED Registers 


LoCRTC_H TOTAL DISP equ 02ECh 
LoCRTC_H SYNC_STRT_WID equ 0O6ECh 
LoCRTC_V_TOTAL DISP equ OAECh 


1oCRTC_V_SYNC_STRT_WID equ OEECh 
ioCRTC_VLINE_CRNT_VLINE equ 12ECh 


LoCRTC_OFF_PITCH equ 16ECh 
LoCRTC_INT_CNTL equ l1AECh 
LoCRTC_GEN_CNTL equ i1EECh 
LoOVR_CLR equ 22ECh 


LoOVR_WID_LEFT_RIGHT equ 26ECh 
LcOVR_WID_TOP_BOTTOM equ 2AECh 


ioCcUR_CLRO equ 2EECh 
iLocUR_CLR1 equ 32ECh 
ioCUR_OFFSET equ 36ECh 
© 1993 ATI Technologies Inc. . | PRG888GXO-01 
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ATIM64.INC 


ioCUR_HORZ_VERT_POSN 


LoCUR_HORZ_VERT_OFF 


LoSCRATCH_REGO 
LoSCRATCH_REG1 


LoCLOCK_SEL_CNTL 
LoBUS_CNTL 
1LoMEM_CNTL 
LoMEM_VGA_WP_SEL 


1oMEM_VGA_RP_SEL 


1ODAC_REGS 
LoDAC_CNTL 


LoGEN_TEST_CNTL 


LoCONFIG_CNTL 
LoCONFIG_CHIP_ID 
LoCONFIG_STATO 
LoCONFIG_STAT1 


7; NON-GUI MEMORY MAPPED 


CRTC_H_ TOTAL _DISP 


CRTC_H_ SYNC_STRT_WID 


CRTC_V_TOTAL_DISP 


CRTC_V_SYNC_STRT_WID 
CRTC_VLINE_CRNT_VLINE 


CRTC_OFF_PITCH 
CRTC_INT_CNTL 
CRTC_GEN_CNTL 


OVR_CLR 
OVR_WID_LEFT_RIGHT 
OVR_WID_TOP_BOTTOM 


CUR_CLRO 

CUR_CLR1 
CUR_OFFSET 
CUR_HORZ_VERT_POSN 
CUR_HORZ_VERT_OFF 


SCRATCH_REGO 
SCRATCH _REG1 


CLOCK_SEL_CNTL 


BUS_CNTL 
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equ 3AECh 
equ 3EECh 
equ 42ECh 
equ 46ECh 
equ 4AECh 
equ 4EECh 
equ 52ECh 
equ 56ECh 
equ 5AECh 
equ 5EECh 
equ 62ECh 
equ 66ECh 
equ 6AECh 
equ 6EECh 
equ 72ECh 
equ 76ECh 
Registers - 
equ 0000h 
equ 0004h 
equ 0008h 
equ 000Ch 
equ 0010h 
equ 0014h 
equ 0018h 
equ 001Ch 
equ 0040h 
equ 0044h 
equ 0048h 
equ 0060h 
equ 0064h 
equ 0068h 
equ 006Ch 
equ 0070h 
equ 0080h 
equ 0084h 
equ 0090h 
equ OOAOh 


expressed in BYTE offsets */ 


// 


‘Tf 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 


Dword 


Dword 


offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 


offset 
offset 


offset 


offset 


00h 
Olh 
02h 
03h 
04h 
05h 
06h 
07h 


10h 
lih 
12h 


18h 
19h 
1Ah 
1Bh 
1ch 


20h 
2ih 


24h 


28h 
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MEM_CNTL 


MEM_VGA_WP_SEL 
MEM _VGA_RP_SEL 


DAC_REGS 
DAC_CNTL 


GEN_TEST_CNTL 
CONFIG_CHIP_ID 


CONFIG_STATO 
CONFIG_STAT1 


7 GUI MEMORY MAPPED Registers 


DST_OFF_PITCH 
DST_x 

DsT_Y 

DST_Y_X 
DST_WIDTH 
DST_HEIGHT 
DST_HEIGHT_WIDTH 
DST_X_WIDTH 
DST_BRES_LNTH 
DST_BRES_ERR 
DST_BRES_INC 
DST_BRES_DEC 
DST_CNTL 


SRC_OFF_PITCH 
SRC_X 

SRC_Y 

SRC_Y_X 

SRC_WIDTH1 
SRC_HEIGHT1 
SRC_HEIGHT1_WIDTH1 
SRC_X_START 
SRC_Y_START 
SRC_Y_X_START 
SRC_WIDTH2 
SRC_HEIGHT2 
SRC_HEIGHT2_WIDTH2 
SRC_CNTL 


HOST_DATAO 
HOST_DATA1 
HOST _DATA2 
HOST_DATA3 
HOST_DATA4 
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equ 


equ 
equ 


equ 
equ 


equ 


equ 
equ 
equ 


equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 


equ 
equ 


equ 


equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 


equ 
equ 
equ 
equ 
equ 


00OBOh 


OOB4h 
OOBS8h 


00CcOh 
00c4h 


o00D0h 


OOEOh 
OOE4h 
OOES8h 


0100h 
0104h 
0108h 
010Ch 
0110h 
0114h 
0118h 
011Ch 
0120h 
0124h 
0128h 
012Ch 
0130h 


0180h 
0184h 
0188h 
018Ch 
0190h 
0194h 
0198h 
019Ch 
01A0h 
O1AGh 
O1A8h 
01ACh 
01B0h 
01B4h 


0200h 
0204h 
0208h 
020Ch 
0210h 


Dword 


Dword 
Dword 


Dword 
Dword 


Dword 


Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 
Dword 


offset 


offset 
offset 


offset 
offset 


offset 


offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 


2Ch 


2Dh 
2Eh 


30h 
31h 


34h 


38h 
39h 
3Ah 


40h 
41h 
42h 
43h 
44h 
45h 
46h 
47h 
48h 
49h 
4Ah 
4Bh 
4ch 


60h 
61h 
62h 
63h 
64h 
65h 
66h 
67h 
68h 
69h 
6Ah 
6Bh 
6Ch 
6Dh 


80h 
8ih 
82h 
83h 
84h 


ATIM64.INC 
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ATIM64.INC 


HOST _DATAS5 
HOST _DATAG6 
HOST _DATA7 
HOST _DATAS 
HOST _DATA9 
HOST _DATAA 
HOST _DATAB 
HOST _DATAC 
HOST _DATAD 
HOST _DATAE 
HOST_DATAF 
HOST_CNTL 


PAT_REGO 
PAT_REG1 
PAT_CNTL 


SC_LEFT 
SC_RIGHT 
SC_LEFT_RIGHT 
Sc_TOP 
SC_BOTTOM 
SC_TOP_BOTTOM 


DP_BKGD_CLR 
DP_FRGD_CLR 
DP_WRITE_MASK 
DP_CHAIN_MASK 
DP_PIX_WIDTH 
DP_MIX 

DP_SRC 


CLR_CMP_CLR 
CLR_CMP_MASK 
CLR_CMP_CNTL 


FIFO STAT 


CONTEXT_MASK 


CONTEXT _LOAD_CNTL 


GUI_TRAJ_CNTL 
GUI_STAT 


PRG888GXO0-01 
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equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 
equ 


equ 
equ 
equ 


equ 
equ 
equ 
equ 
equ 
equ 


equ 
equ 
equ 
equ 
equ 
equ 
equ 


equ 
equ 
equ 


equ 


equ 
equ 


equ 
equ 


0214h 


0216h 
021Ch 
0220h 
0224h 
0228h 
022Ch 
0230h 
0234h 
0238h 
023Ch 
0240h 


0280h 
0284h 
0288h 


O2A0h 
O2A4h 
O2A8h 
O2ACh 
O02B0h 
02B4h 


02Cc0h 
02C4h 
02C8h 
02CCh 
02D0h 
02D4h 
02D8h 


0300h 
0304h 
0308h 
0310h 


0320h 
032Ch 


0330h 
0338h 


// 
// 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 


Dword 


Dword 
Dword 


Dword 
Dword 


offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 


offset 


offset 
offset 


offset 
offset 


85h 
86h 
87h 
88h 
89h 
8Ah 
8Bh 
8ch 
8Dh 
8Eh 
8Fh 
90h 


A0Oh 
Aih 
A2h 


A8h 
A9h 
AAh 
ABh 
Ach 
ADh 


Boh 
Blh 
B2h 
B3h 
Bah 
B5h 
B6h 


coh 
Cih 
C2h 
C4h 


csh 
CBh 


CcCh 
CEh 
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ATIM64.H 


ATIM64.H 


ATIM64.H 
C include file that contains ATI MACH64 register definitions 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


SSE Sess SSeS Ses eee aS SSR SSS SSE SEES SES SESS SSE SEBS STE SESE SSeS sss / 


/* NON-GUI IO MAPPED Registers */ 


#define iLoCRTC_H TOTAL _DISP 0x02EC 
#define LoCRTC_H_SYNC_STRT_WID Ox06EC 
#define 1oCRTC_V_TOTAL_DISP Ox0AEC 


#define LoCRTC_V_SYNC_STRT_WID Ox0EEC 
#define 1LoCRTC_VLINE_CRNT_VLINE 0x12EC 


© 1993 ATI Technologies Inc. 
Proprietary and Confidential 


#define i1oCRTC_OFF_ PITCH Ox16EC 
#define ioCRTC_INT CNTL Ox1AEC 
#define iLoCRTC_GEN_CNTL Ox1EEC 
#define iLoOVR_CLR Ox22EC 
#define 1LoOVR_WID_LEFT_RIGHT Ox26EC 
#define 1LoOVR_WID_TOP_BOTTOM Ox2AEC 
#define iLoCUR_CLRO Ox2EEC 
#define iLoCUR_CLR1 Ox32EC 
#define LoCUR_OFFSET 0x36EC 
#define 1LoCUR_HORZ_VERT_POSN Ox3AEC 
#define LoCUR_HORZ_VERT_OFF Ox3EEC 
#define 1LoSCRATCH REGO 0Ox42EC 
#define 1LoSCRATCH_REG1 Ox46EC 
#define iLoCLOCK_SEL_CNTL Ox4AEC 
#define iLoBUS_CNTL Ox4EEC 
#define ioMEM CNTL 0x52EC 
#define LOoMEM_VGA_WP_SEL Ox56EC 
#define iLoMEM_VGA_RP_ SEL Ox5AEC 
#define LoDAC_REGS Ox5EEC 
#define 1LoDAC_CNTL 0x62EC 
#define 1oGEN_TEST_CNTL Ox66EC 
#define LoCONFIG_CNTL Ox6AEC 
#define iLoCONFIG_CHIP_ID Ox6EEC 
#define iLoCONFIG_ STATO 0x72EC 
#define LoCONFIG_STAT1 0x76EC 
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ATIM64.H 


/* NON-GUI MEMORY MAPPED Registers - expressed in BYTE offsets */ 


#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 


#define 
#define 
#define 


#define 
#define 
#define 
#define 
#define 


#define 
#define 


#define 


#define 


#define 


#define 
#define 


#define 
#tdefine 


#define 
#define 


#define 
#define 


CRTC_H_TOTAL_DISP 


CRTC_H_ SYNC_STRT_WID 


CRTC_V_TOTAL_DISP 


CRTC_V_SYNC_STRT_WID 
CRTC_VLINE_CRNT_VLINE 


CRTC_OFF_PITCH 
CRTC_INT_CNTL 
CRTC_GEN_CNTL 


OVR_CLR 
OVR_WID_LEFT_RIGHT 
OVR_WID_TOP_BOTTOM 


CUR_CLRO 

CUR_CLR1 
CUR_OFFSET 
CUR_HORZ_VERT_POSN 
CUR_HORZ_VERT_OFF 


SCRATCH_REGO 
SCRATCH_REG1 


CLOCK_SEL_CNTL 


BUS_CNTL 


MEM_CNTL 


MEM_VGA_WP_SEL 
MEM _VGA_RP_SEL 


DAC_REGS 
DAC_CNTL 


GEN_TEST_CNTL 
CONFIG _CHIP_ID 


CONFIG_STATO 
CONFIG _STAT1 


0x0000 
0x0004 
0x0008 
0x000C 
0x0010 
0x0014 
0x0018 
0x001C 


0x0040 
0x0044 
0x0048 


0x0060 
0x0064 
0x0068 
0x006C 
0x0070 


0x0080 
0x0084 


0x0090 


Ox00A0 


0x00B0 


Ox00B4 
Ox00B8 


0x00C0 
0x00c4 


0x00D0 
0x00E0 


Ox00E4 
Ox00E8 


/* GUI MEMORY MAPPED Registers */ 


#define 
#define 
#define 
#define 
#define 
#define 


DST_OFF_PITCH 
DST_xX 

DST_Y 
DST_Y_xX 
DST_WIDTH 
DST_HEIGHT 
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0x0100 
0x0104 
0x0108 
0x010C 
0x0110 
0x0114 


// 
// 
// 
// 
// 
// 
// 
// 


// 
// 
// 


// 
// 
// 
// 
// 
// 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 


Dword 


Dword 


Dword 


Dword 
Dword 


Dword 
Dword 


Dword 


Dword 
Dword 


‘Dword 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


offset 
offset 
offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 


offset 
offset 


offset 


offset 


offset 


offset 
offset 


offset 
offset 


offset 


offset 
offset 
offset 


offset 
offset 
offset 
offset 
offset 
offset 


00 
01 
02 
03 
04 
05 
06 
07 


10 
11 
12 


18 
19 
1A 
1B 
ic 


20 
21 


24 


28 


2c 


2D 
2E 


30 
31 


34 


38 
39 
3A 


40 
41 
42 
43 
44 
45 
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#define DST_HEIGHT WIDTH 0x0118 // Dword offset 46 
#define DST_X WIDTH 0x011C // Dword offset 47 
#define DST_BRES_LNTH 0x0120 // Dword offset 48 
#define DST_BRES_ERR 0x0124 // Dword offset 49 
#define DST_BRES_INC 0x0128 // Dword offset 4A 
#define DST_BRES_DEC 0x012C // Dword offset 4B 
#define DST_CNTL 0x0130 // Dword offset 4C 
#define SRC_OFF_PITCH 0x0180 // Dword offset 60 
#define SRC_X 0x0184 // Dword offset 61 
#define SRC_Y 0x0188 // Dword offset 62 
#define SRC_Y_X 0x018sc // Dword offset 63 
#define SRC_WIDTH1 0x0190 // Dword offset 64 
#define SRC_HEIGHT1 0x0194 // Dword offset 65 
#define SRC_HEIGHT1_WIDTH1 0x0198 // Dword offset 66 
#define SRC_X_ START 0x019C // Dword offset 67 
#define SRC_Y_START Ox01A0 // Dword offset 68 
#define SRC_Y_X START Ox01A4 // Dword offset 69 
#define SRC_WIDTH2 Ox01A8 // Dword offset 6A 
#define SRC_HEIGHT2 Ox01AC // Dword offset 6B 
#define SRC_HEIGHT2 WIDTH2 0x01B0 // Dword offset 6C 
#define SRC_CNTL 0x01B4 // Dword offset 6D 
#define HOST _DATAO 0x0200 // Dword offset 80 
#define HOST _DATA1 0x0204 // Dword offset 81 
#define HOST_DATA2 0x0208 // Dword offset 82 
#define HOST _DATA3 0x020C // Dword offset 83 
#define HOST DATA4 0x0210 // Dword offset 84 
#define HOST _DATAS 0x0214 // Dword offset 85 
#define HOST DATA6 0x0218 // Dword offset 86 
#define HOST_DATA7 0x021C // Dword offset 87 
#define HOST_DATAS 0x0220 // Dword offset 88 
#define HOST_DATA9 0x0224 // Dword offset 89 
#define HOST _DATAA 0x0228 // Dword offset 8A 
#define HOST DATAB 0x022C // Dword offset 8B 
#define HOST _DATAC 0x0230 // Dword offset 8c 
#define HOST DATAD 0x0234 // Dword offset 8D 
#define HOST_DATAE 0x0238 // Dword offset 8E 
#define HOST DATAF 0x023C // Dword offset 8F 
#define HOST_CNTL 0x0240 // Dword offset 90 
#define PAT REGO 0x0280 // Dword offset AO 
#define PAT_REG1 0x0284 // Dword offset Al 
#define PAT_CNTL 0x0288 // Dword offset A2 
#define SC_LEFT Ox02A0 // Dword offset A8 
#define SC_RIGHT 0x02A4 // Dword offset A9 
#define SC_LEFT RIGHT O0x02A8 // Dword offset AA 
#define SC_TOP Ox02AC // Dword offset AB 
#define SC_BOTTOM 0x02B0 // Dword offset AC 
#define SC_TOP_BOTTOM 0x02B4 // Dword offset AD 
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#define 
#define 
#define 
#define 
#define 
#define 
#define 


#define 
#define 
#define 
#define 


#define 
#define 


#define 
#define 


DP_BKGD_CLR 
DP_FRGD_CLR 
DP_WRITE_MASK 
DP_CHAIN MASK 
DP_PIX WIDTH 
DP_MIX 

DP_SRC 


CLR_CMP_CLR 
CLR_CMP_MASK 
CLR_CMP_CNTL 
FIFO_STAT 


CONTEXT_MASK 
CONTEXT_LOAD_CNTL 


GUI_TRAJ_CNTL 
GUI_STAT 
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0x02C0 
0x02C4 
0x02Cc8 
0x02cCc 
0x02D0 
0x02D4 
0x02D8 


0x0360 
0x0304 
0x0308 
0x0310 


0x0320 
0x032C 


0x0330 


0x0338. 


// 
// 


Dword 
Dword 
Dword 
Dword 
Dword 
Dword 
Dword 


Dword 
Dword 
Dword 
Dword 


Dword 
Dword 


Dword 
Dword 


offset 
offset 
offset 
offset 
offset 
offset 
offset 


offset 
offset 
offset 


offset 


offset 
offset 


offset 
offset 


BO 
Bl 
B2 
B3 
B4 
B5 
B6 - 


co 
Cl 
C2 
C4 


cs 
CB 


cc 
CE 
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Commonly used MACH 64 draw functions such as filled rectangles and lines. 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


SES eSB Se SS SSS SESS ESSE SSSR ESS SSE SSE RSE RE SSS SEES SSS SS SS SSS sss ssssssssssses=*/ 
#include <stdio.h> 
#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 
#include <dos.h> 
#include “atim64.h” 
#include “sample.h” 
f*. eiieceeccecoeweeu stl oe ceed eee e cee cede ne ee ae eee = ctf sce aiken ae eee Soeae 
GET COLOR_CODE - return the color code for the given generic color. 
A “generic” color value is converted into a real color value for sample 
functions set_fg_color() and get_fg_color() to make color setting 
video mode independant. Typically, the returned value is used as the 
input value to set_fg_color() or set_bg_color(). 
ee ne eee ee eee a ee ee ee */ 
unsigned long get_color_code(int generic_color) 
{ 
unsigned long color; 
color = BLACK; 
switch (generic_color) 
{ 
case BLACK: 
color = BLACK; 
break; 
case DARKBLUE: 
if ((modeinfo.bpp == 4) || (modeinfo.bpp == 8)) 
{ 
color = DARKBLUES8; 
} 
if ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
{ 
color = DARKBLUEI15; 
} 
1£ ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 
{ 
color = DARKBLUE16;. 
} 
© 1993 ATI Technologies Inc. PRG888GX0-01 
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if (modeinfo.bpp == 24) 
{ 
color = DARKBLUE24 RGB; 
} 
if (modeinfo.bpp == 32) 
{ 


color = DARKBLUE32_RGBA; 


} 
break; 


case DARKGREEN: 
if ((modeinfo.bpp == 4) || 
{ 
color = DARKGREENS8; 


1£ ((modeinfo.bpp == 16) && (modeinfo.depth 


color = DARKGREENIL5; 


1f ((modeinfo.bpp == 16) && (modeinfo.depth 


color = DARKGREEN16; 


if (modeinfo.bpp == 24) 


(modeinfo.bpp == 


color = DARKGREEN24 RGB; 


if (modeinfo.bpp == 32) 


color = DARKGREEN32_RGBA; 


} 


break; 
case DARKCYAN: 

if ((modeinfo.bpp == 4) || 

color = DARKCYANS; 

if ((modeinfo.bpp == 16) && 
color = DARKCYAN15; 

1f ((modeinfo.bpp == 16) && 
color = DARKCYAN16; 

if (modeinfo.bpp == 24) 


color = DARKCYAN24_ RGB; 


1f (modeinfo.bpp == 32) 


PRG888GX0-01 
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(modeinfo.bpp == 


(modeinfo.depth 


(modeinfo.depth 


8)) 


== 555)) 


== 565) ) 


8)) 


== 555)) 


565) ) 
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color = DARKCYAN32_RGBA; 
} 
break; 


case DARKRED: 
if ((modeinfo.bpp == 4) I| (modeinfo.bpp == 8) ) 


{ 
color = DARKREDS8; 


} 
1f ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 


{ 
color = DARKRED15; 


} 
if ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 


{ 
color = DARKRED16; 


ie (modeinfo.bpp == 24) 
color = DARKRED24_ RGB; 
if (modeinfo.bpp == 32) 
color = DARKRED32_RGBA; 
} 
break; 
case DARKMAGENTA: 
if ((modeinfo.bpp == 4) || (modeinfo.bpp == 8) ) 
color = DARKMAGENTAS; 
Lf ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
color = DARKMAGENTAIL5S; 
1£ ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 
color = DARKMAGENTAI6; 
if (modeinfo.bpp == 24) 
color = DARKMAGENTA24_RGB; 
if (modeinfo.bpp == 32) 
color = DARKMAGENTA32_RGBA; 
} 


break; 


case BROWN: 
1f ((modeinfo.bpp == 4) |! (modeinfo.bpp == 8)) 


© 1993 ATI Technologies Inc. | PRG888GX0-01 
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color = BROWNS; 
1f ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
color = BROWN15; 
if ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 
color = BROWN16; 
if (modeinfo.bpp <== 24) 
color = BROWN24_RGB; 
if (modeinfo.bpp == 32) 
Ssise = BROWN32_RGBA; 
} 
break; 
case LIGHTGRAY: 
if ((modeinfo.bpp == 4) [| (modeinfo.bpp == 8)) 
color = LIGHTGRAYS8; 
if ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
color = LIGHTGRAY15; 
if ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 
color = LIGHTGRAY16; 
1f (modeinfo.bpp == 24) 
color = LIGHTGRAY24_RGB; 
if (modeinfo.bpp == 32) 
color = LIGHTGRAY32_RGBA; 
} 
break; 
case DARKGRAY: 
if ((modeinfo.bpp == 4) || (modeinfo.bpp == 8)) 
color = DARKGRAY8; 
i ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 


{ 
color = DARKGRAY15; 


PRG888GXO0-01 © 1993 ATI Technologies Inc. 
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a ((modeinfo.bpp == 16) && (modeinfo.depth 
color = DARKGRAY16; 
e (modeinfo.bpp == 24) 
color = DARKGRAY24_RGB; 
ee (modeinfo.bpp == 32) 
color = DARKGRAY32_RGBA; 
} 


break; 


case LIGHTBLUE: 
if ((modeinfo.bpp <== 4) 
{ 


|| (modeinfo.bpp == 


color = LIGHTBLUES8; 


if ((modeinfo.bpp == 16) && (modeinfo.depth 
color = LIGHTBLUE15; 

- ((modeinfo.bpp == 16) && (modeinfo.depth 
color = LIGHTBLUE16; 

ie (modeinfo.bpp == 24) 

color = LIGHTBLUE24_RGB; 

- (modeinfo.bpp == 32) 

color = LIGHTBLUE32_RGBA; 

} 

break; 


case LIGHTGREEN: 
if ((modeinfo.bpp == 4) 
{ 


11 (modeinfo.bpp == 


color = LIGHTGREENS8; 
} 
if ((modeinfo.bpp == 16) && 
{ 


(modeinfo.depth 
color = LIGHTGREENI15; 

} 

if ((modeinfo.bpp == 16) && 
{ 


(modeinfo.depth 
color = LIGHTGREEN16; 


} 
if (modeinfo.bpp == 24) 
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== 565)) 


8)) 


555) ) 


565) ) 


8)) 


z= 555)) 


565)) 
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color = LIGHTGREEN24_ RGB; 
} 
1f (modeinfo.bpp == 32) 
{ 

color = LIGHTGREEN32_RGBA; 


ore 
case LIGHTCYAN: 

if ((modeinfo.bpp == 4) || (modeinfo.bpp == 8) ) 

color = LIGHTCYANS; 

if ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
color = LIGHTCYAN15; 

if ((modeinfo.bpp == 16) && (modeinfo.depth == 565) ) 
color = LIGHTCYAN16; 

if (modeinfo.bpp == 24) 
color = LIGHTCYAN24_RGB; 

if (modeinfo.bpp == 32) 
color = LIGHTCYAN32_RGBA; 

} 

break; 

case LIGHTRED: 

if ((modeinfo.bpp == 4) || (modeinfo.bpp == 8)) 

color = LIGHTRED8; 

1£ ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
color = LIGHTRED15; 

1f ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 
color = LIGHTREDI16; 

1f (modeinfo.bpp == 24) 
color = LIGHTRED24_RGB; 


L£ (modeinfo.bpp == 32) 


color = LIGHTRED32_RGBA; 
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} 
break; 


case LIGHTMAGENTA: 

if ((modeinfo.bpp == 4) || (modeinfo.bpp == 8)) 

color = LIGHTMAGENTAS; 

if ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
color = LIGHTMAGENTA15; 

if ((modeinfo.bpp == 16) && (modeinfo.depth == 565) ) 
color = LIGHTMAGENTA16; 

if (modeinfo.bpp == 24) 
color = LIGHTMAGENTA24_RGB; 

if (modeinfo.bpp == 32) 
color = LIGHTMAGENTA32_RGBA; 

} 

break; 

case YELLOW: 

if ((modeinfo.bpp == 4) I| (modeinfo.bpp == 8)) 

color = YELLOW8; 

if ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
color = YELLOW15; 

if ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 
color = YELLOW16; 

if (modeinfo.bpp == 24) 
color = YELLOW24_RGB; 

1f (modeinfo.bpp == 32) 
color = YELLOW32_RGBA; 

} 


break; 


case WHITE: 


1f ((modeinfo.bpp == 4) I! (modeinfo.bpp == 8)) 
{ 
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color = WHITES; 


} 
1f ((modeinfo.bpp == 16) && (modeinfo.depth == 555)) 
{ 
color = WHITE15; 
} 
if ((modeinfo.bpp == 16) && (modeinfo.depth == 565)) 
{ 
color = WHITE16; 
} 
if (modeinfo.bpp == 24) 
{ 
color = WHITE24_RGB; 
} 
if (modeinfo.bpp == 32) 
{ 
color = WHITE32_RGBA; 
} 
break; 
} 
return (color); 
} 
/ Ca ee 
SET FG COLOR - set foreground color 
i i i si my, a i. iti. ss er: nen ek ein is ik iain sii, Si is: sis: nit: i: i is: i: ti, ai: es. Si. ti Se, m:n "ii, * / 
void set_fg_ color(unsigned long color) 
{ 
wait_for_fifo(1); 
regw(DP_FRGD_CLR, color); 
} 
/ Fee cere eee ee wee re me oe es ee cre ee ee ee oe ne eee ee ee ee eee mm eee ee eee cae ee ee a eee cr ee eee eee ee we 
GET _ FG COLOR - get foreground color 
enn en ee ee ee * / 
unsigned long get_fg_color(void) 
{ 
wait _for_idle(); // insure engine is idle before reading a GUI register 
return (regr(DP_FRGD_CLR) ); 
} 
/ 5, nee ee ee ee re en ee ne ee ee 
SET _BG_ COLOR - set background color 
ee en ee ed */ 
void set_bg_ color(unsigned long color) 
{ 
wait_for_fifo(1); 
regw(DP_BKGD_CLR, color); 
} 
/ FO rr ee ee ee oe ee ee ee eee ee eee eee sa cme cer ee eee eee ee ae ee ee ee eee te tt ee 
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GET_BG COLOR - get background color 


Ee en ee a ee en ee ne ee ea ee */ 

unsigned long get_bg_color(void) 

{ 
wait_for_idle(); // insure engine is idle before reading a GUI register 
return (regr(DP_BKGD_CLR) ); 

} 

/* jt San i eas ahem; Gus <b <n myn’ gh Sl mem eb" ip, alban“ Gp ses’ hess es och fess ems “ima eae um’ gpms GU in Gu cam “enin “sles SS Gel (NM, mn n'a “Ol tnd“ 

SET_FG MIX - set the current foreground mix 

oo ee ee ee ee eee eee nese */ 

void set_fg_mix(int mix) 

{ 
unsigned long temp; 
wait_for_idle(); // insure engine is idle before reading a GUI register 
temp = (unsigned long) (mix) ; 
temp = temp << 16; 
temp = (unsigned long) (temp | (regr(DP_MIX) & Oxffff)); 
regw(DP_MIX, temp); 

} 

/* Se: iS sh skint, c-Si“ wm sl iy <i: mp: a b,c‘ ‘am leit ek i Sg: mi em“ ets’. al: 

GET _FG MIX - get the current foreground mix 

a aes! OS ic es ss a cs ems srw ce Sn awe ee ew ce J nce seen ese? ms Tes Sein {Gs too wl Tees” ech Sia tne eine ‘Seo Sie; oss“ | ss <n “soe Sh ce et */ 

int get_fg_mix(void) 

{ 
wait_for_idle(); // insure engine is idle before reading a GUI register 
return ((int) (regr(DP_MIX) >> 16)); 

} 

/* oe nnn ren ee ee 

SET BG MIX - set the current background mix 

tas sim cas ute Ginn om GED UD es ets MD GD GES RL es Mm a ae cu ney ate nD es ee ee Oe ee me EO ae a OD coe SD OS tes meee ne ese See tate me os oe coe ee oe se ee */ 

void set_bg mix(int mix) 

{ 
unsigned long temp; 
wait_for_idle(); // insure engine is idle before reading a GUI register 
temp = regr(DP_MIX) & Oxffff0000; 
regw(DP_MIX, (unsigned long) (temp | mix) ); 

} 

/* a en anne a ee ee ee 

GET_BG MIX - get the current background mix 

ee ee eee nee eae nee, ete sme es cs fete ce cry SOO cum me ens ess se me See ome ce me nts eee es em Gee ae DS Oe Oe eS ee ee ee oe ee ee oe Oe ee ee ee */ 

int get_bg mix(void) 

{ 
wait _for_idle(); // insure engine is idle before reading a GUI register 
return ((int) (regr(DP_MIX) & Oxffff)); 

} 
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/* ee me ee em eee a ee woe ae > OS OD Om wee wet OH OP OD OD OD OD ms mm Om me > Gm OD ce me me mew om me ee on ee es ws 
get_primary_color - separate the primary color component from a given 
color (i.e. separate the RED, GREEN, or BLUE 
component value 


This function is only useful for non-palettized modes such as 15, 16, 24, 
and 32 bpp color depths. If this function is called for 4 or 8 bpp modes, 
the index value will be returned without modification. 


a ee ee ee ee * / 
unsigned long get_primary_color(PRIMARYCOLOR primarycolor, 
unsigned long color) 
{ 
unsigned long primary; 
switch (modeinfo.bpp) 
{ 
case 4: 
case 8: 
primary = color; 
break; 
case 16: 
1f (modeinfo.depth == 555) 
{ 
switch (primarycolor) 
{ 
case RED: primary = (color >> 10) & Oxlf; break; 
case GREEN: primary = (color >> 5) & Ox1lf; break; 
case BLUE: primary = color & Ox1f; break; 
} 
} 
else // 565 weight 
{ 
switch (primarycolor) 
{ 
case RED: primary = (color >> 11) & Ox1lf; break; 
case GREEN: primary = (color >> 5) & Ox3f; break; 
case BLUE: primary = color & Oxif; break; 
} 
} 
break; 
case 24: 
switch (primarycolor) 
{ 
case RED: primary = (color >> 16) & Oxff; break; 
case GREEN: primary = (color >> 8) & Oxff; break; 
case BLUE: primary = color & Oxff; break; 
} 
break; 
case 32: 
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switch (primarycolor) 


{ 
case RED: primary = (color >> 24) & Oxff; break; 
case GREEN: primary = (color >> 16) & Oxff; break; 
case BLUE: primary = (color >> 8) & Oxff; break; 
} 

break; 


} 


return (primary) ; 
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Functions to define, enable, disable, and position the Mach64 hardware 


cursor. 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


#include <stdlib.h> 
#include <stdio.h> 

#include <string.h> 
#include <conio.h> 

#include <dos.h> 


#include “atim64.h” 
#Hinclude “sample.h” 


SET _HWCURSOR - define hardware cursor bitmap 


gaseeesssesessecsssszet / 


This function sets up the hardware cursor data region according to the 
given bitmap data. The data region is located at (0, y). 


The hardware cursor be vary in size from 1x1 to 64x64. The expected bitmap 
format is LSB to MSB. The LSB will be drawn first in a left to right 
direction. Since the cursor position is NOT set in this routine, the 
position should be set (set_hwcursor_pos) prior to enabling the cursor. 


void set_hwcursor(int y, int width, int height, 


unsigned long color0, unsigned long colori, 


unsigned int *bitmap) 


unsigned long cur_offset, cur_size, cur_pitch; 
unsigned long templ, temp2, temp3, temp4, temp5, 
unsigned long redi, greeni, bluel, red2, green2, 
unsigned long rshift, gshift, bshift; 

unsigned long bitdata; 

int i, index, dataindex, start, widthwords; 
PALETTE entry; 


// Check that cursor size is within limits 
1f ((width < 1) II (width > 64)) return; 
if ((height < 1) II (height > 64)) return; 


// set cursor dimensions 
cursordata.y = yi 
cursordata.width = width; 
cursordata.height = height; 
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temp6; 
blue2; 
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cursordata.color0 = color0; 
cursordata.colorl = colorl; 


// set hwcursor bitmap to transparent 
for (index = 0; index < (HWCURHEIGHT * HWCURWIDTH); index++) 
{ 
cursordata.bitmap[index] = Oxaaaa; 
} 


// load user hwcursor data into bitmap 
dataindex = 0; 
widthwords = width / 8; 
if (width > widthwords * 8) 
{ 
widthwords++; 
} 
start = HWCURWIDTH - widthwords; 


for (index = start; index < (HWCURWIDTH * height); index = index + HWCURWIDTH) 


cursordata.bitmap [index + i] = *(bitmap + dataindex); 
if (width < 8) 
{ 


cursordata.bitmap[index + i] = cursordata.bitmap[index + i] << 


((8 - width) * 2); 
} 
dataindex++; 
i++; 
} while (i < widthwords) ; 
} 


// calculate offset in bytes 
cur_offset = (unsigned long) (modeinfo.pitch); 
1f (modeinfo.bpp == 4) 
{ 
cur_offset = (unsigned long) (cur_offset / 2); 
} 
else 
{ 


cur_offset (unsigned long) (cur_offset * (modeinfo.bpp / 8)); 


} 
cur_offset = (unsigned long) (cur_offset * y); 


// calculate cursor pitch (assuming 16 bpp) 
cur_pitch = HWCURWIDTH / 8; 
cur_pitch = cur_pitch << 22; 


// convert byte offset to qword offset and limit to 20 bits (used DWORD 


// format used in DST_OFF_PITCH 
cur_offset = (cur_offset / 8) & Ox000fffff; 
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// Use 16 bpp to setup hardware cursor bitmap 


// save vital registers 
wait_for_idle(); 

templ = regr(DP_PIX_WIDTH) ; 
temp2 = regr(DP_CHAIN_ MASK) ; 
temp3 = regr(DST_OFF_PITCH) ; 
temp4 = regr(DP_SRC); 

temp5 = regr(DP_MIX); 

temp6 = regr(DST_CNTL) ; 


// load bitmap data to hardware cursor bitmap data area 
regw(DP_PIX WIDTH, (templ & Oxffc00000) | BYTE_ORDER_LSB_TO_MSB | 
HOST_16BPP | SRC_16BPP | DST_16BPP); 
regw(DP_CHAIN MASK, 0x0410); // chain mask for 16 bpp 
regw(DST_OFF_PITCH, cur_pitch | cur_offset); 
regw(DP_SRC, FRGD_SRC_HOST); 
regw(DP_MIX, FRGD_MIX_S | BKGD_MIX_D); 
regw(DST_CNTL, DST_Y_TOP_TO_ BOTTOM | DST_X_LEFT_TO_RIGHT) ; 
regw(DST_X, 0); 
regw(DST_Y, 0); 
regw(DST_HEIGHT, HWCURHEIGHT) ; 
regw(DST_WIDTH, HWCURWIDTH) ; 
for (index = 0; index < (HWCURHEIGHT * HWCURWIDTH * 2); index = index + 2) 
{ 
wait _for_fifo(1); 
bitdata = (unsigned long) (cursordata.bitmap [index+1] ); 
bitdata = (bitdata << 16) | (cursordata.bitmap [index] ); 
regw(HOST DATAO, bitdata); 
} 
wait _for_idle(); 


// set cursor size offsets 

cur_size = (unsigned long) (64 - height); 

cur_size = (unsigned long) ((cur_size << 16) | (64 - width) ); 
regw(CUR_HORZ _VERT_OFF, cur_size); 


// set cursor colors 
if (modeinfo.bpp > 8) 
{ 
// for 15, 16, 24, 32 color modes 
switch (modeinfo.bpp) 
{ 
case 16: 
if (modeinfo.depth == 555) 
{ 
rshift = 
gshift 3; 
bshift = 3; 


I 
WwW 
we 


} 
else // 565 weight 
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rshift = 3; 
gshift = 2; 
bshift = 3; 
} 
break; 


case 24: 

case 32: 
rshift = 0; 
gshift = 
bshift = 0; 
break; 


I 
Oo 
we 


} 


// cursor color 0 

red1l = get_primary_color(RED, color0) << rshift; 
greenl = get_primary_color(GREEN, color0) << gshift; 
bluel = get_primary_color(BLUE, color0) << bshift; 


// cursor color 1 

red2 = get_primary_color(RED, colorl) << rshift; 
green2 = get_primary_color(GREEN, colorl) << gshift; 
blue2 = get_primary_color(BLUE, colorl) << bshift; 


if (querydata.dac_type != DAC_ATI68860) 

{ 
// standard setup for other dacs 
regw(CUR_CLRO, (red1l << 24) | (greenl << 16) | (bluel << 8)); 
regw(CUR_CLR1, (red2 << 24) | (green2 << 16) | (blue2 << 8)); 


} 
else 
{ 
// for 4, 8 bpp color modes 
if (querydata.dac_type == DAC_ATI68860) 
{ 
entry = get_palette((int) (color0O & Oxff)); 
red1 = (unsigned long) (entry.red); 
greenl = (unsigned long) (entry.green) ; 
bluei = (unsigned long) (entry.blue) ; 
entry = get_palette((int) (colorl & Oxff)); 
red2 = (unsigned long) (entry.red) ; 
green2 = (unsigned long) (entry.green) ; 
blue2 = (unsigned long) (entry.blue) ; 
} 
else 
{ 
regw(CUR_CLRO, color0O & Oxff); 
regw(CUR_CLR1, colorl & Oxff); 
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if (querydata.dac_type == DAC_ATI68860) 

{ 
// special setup for ATI68860/880 dac for cursor colors 
outp(LoDAC_CNTL, 1); 
outp(1oDAC_REGS, 0); 


outp (1LODAC_REGS+1, (int) red1); 
outp(LoDAC_REGS+1, (int) green1) ; 
outp(LoDAC_REGS+1, (int)bluel); 


outp(ioDAC_REGS+1, (int) red2); 
outp(1LoDAC_REGS+1, (int) greenz2) ; 
outp (LoDAC_REGS+1, (int)blue2); 


outp(LoDAC_CNTL, 0); 
} 


// set offset to cursor data region 
regw(CUR_OFFSET, cur_offset) ; 


// restore vital registers 
regw(DP_PIX WIDTH, temp1); 
regw(DP_CHAIN MASK, temp2); 
regw(DST_OFF_PITCH, temp3); 
regw(DP_SRC, temp4); 
regw(DP_MIX, temp5) ; 
regw(DST_CNTL, temp6); 


} 
/* ee eee ee ce eee eee me eee me ee ee ea eee esa se cee et ote ae ie ome neh mmm cee etm te ete cee ee ee ee eee 
ENABLE HWCURSOR - turn on the hardware cursor 
“co and ess ok tein a eu: tess oul es ems a "ems Gv fue cum ie Sn ta in uns: as ees eee Jim ces ‘0s ‘nem, te Ge ce, es vow a "Gnd Gad tn, So co wk to wet ea sows, ew eo Oe x / 
void enable _hwcursor (void) 
{ 
// enable hardware cursor 
outpw(ioGEN_TEST CNTL, inpw(ioGEN_TEST_CNTL) | 0x80); 
} 
[4 cee wee ee wee ewer eae aa a ee ee ee eee ee eee en en en ee ee eee ne 
DISABLE _HWCURSOR - turn off the hardware cursor 
eae tas mee we tees ROY an Uae OG GR Ge uD OED GD GES tems GED OS Gd GE OW OE OED ED Om GD OD WD OD GD Mm OD me ma Gm Oto Ons Oa Mee OD OS Oe OS Oe Oe ee tn ee Oe OS Os ee oe ie ets we ti ne “ees eee ce */ 
void disable_hwcursor (void) 
{ 
// @isable hardware cursor 
outpw(ioGEN_TEST CNTL, inpw(ioGEN_TEST_CNTL) & Oxff7f); 
} 
/* fee uae Se ee Ge OF GH GD GR Om GD ED OD UD ED OD GD Om OS Om fem om oe Om OD Om Om Oe we OD oe oD om om om om oo om oe om om om me em em om om om i om On mm me me sm me me mw es ee 
SET_HWCURSOR_POS - set the hardware cursor position 
ee we ee ee me om we oe ae On ee nD GD OD GE OD OS OS OD oe OD OD Om Oe Oe ee Oe ee > Ge Oe es os Oe em oe om om em me en ee ee em oe ne we se es ee se */ 
void set_hwcursor_pos(int x, int y) 
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/* 


unsigned long cur_pos; 


// check for coordinate violations 
1f (x < 0) x = O; 
if (y < 0) y = O; 


// set cursor position 
cur_pos = (unsigned long)y; 


cur_pos = (unsigned long) ((cur_pos << 16) 


regw(CUR_HORZ_VERT_POSN, cur_pos) ; 


vold get_hwcursor_pos(POINT *position) 


{ 


unsigned long cur_pos; 


// get cursor position 


cur_pos = regr(CUR_HORZ_VERT_POSN) ; 


position->x 


(int) (cur_pos & Oxffff); 


HWCURSOR.C 


[| x)? 


position->oy = (int) ((cur_pos >> 16) & Oxffff); 


© 1993 ATI Technologies Inc. 
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GLOB.C 


Global structures for sample code. 


Copyright (c) 1993 ATI Technologies Inc. All rights 


reserved 


SPSS SSeS SSeS SSS SS SSS SSS SSS S RSS HSS SSS SESS SS ESSE ES SS SESS SSE SSS Ses sSeSsssssee=* / 


#include 
#include 
#include 
#include 
#include 


#include 
#include 


/* Global structure declarations */ 


<stdio.h> 
<stdlib.h> 
<string.h> 
<conio.h> 
<dos.h> 


“atim64.h” 
“sample.h” 


MODECFG modeinfo; 


HWCURSOR cursordata; 
QUERY_STRUCTURE querydata; 


int mem_cntl; 
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SAMPLE.H 


SAMPLE .H 
Cc include file for sample functions 


Copyright (c) 1993-94 ATI Technologies Inc. All rights reserved 


SOE SERB BRE SSS Se EH SEES SESH HSS SEE BES SEES ESE ESSE SSS SSS SSeS Sess esses =* / 
/* Function constants */ 

// BIOS interface constants 
#define MODE_640x480 0x12 
#define MODE_800x600 Ox6a 
#define MODE _ 1024x768 ox55 
#define MODE_1280x1024 0x83 
#define PITCH 1024 0 
#define PITCH NOCHANGE 1 
#define PITCH _XRES 2 
#define COLOR_DEPTH 4 1 
#define COLOR_DEPTH_8 2 
#define COLOR_DEPTH_15 3 
#define COLOR_DEPTH 16 4 
#define COLOR_DEPTH 24 5 
#define COLOR_DEPTH_32 6 
#define ACCELERATOR MODE 1 
#define VGA_MODE 0 
#define NO_ERROR 0 
#define YES_ERROR 1 
#define NOT SUPPORTED 2 
// Detection contants 
#define NO MACH64 0 
#define YES MACH64 1 
// Query constants 

#define HEADER_ONLY 0 
#define HEADER_AND_ MODE 1 
// Other constants 

#define MEM WRITE 1 
#define MEM READ 0 
// Waitforidle() and Waitforfifo() errors and constants 
#define BAD_IDLE 0 
#define BAD_FIFO 1 
© 1993 ATI Technologies Inc. PRG888GX0-01 
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#define DOS _TICK_ADDRESS 0x0000046c 
#define IDLE TIMEOUT 50 
#define FIFO TIMEOUT 50 


// VGA aperture address constants 

#define VGA_REGISTER_BASE 0xb000fc00 
#define LOW_APERTURE_BASE 0xa0000000 
#define HIGH APERTURE _BASE 0xa8s000000 


// Mix control values for setfgmix() and setbgmix() functions 
#define NOT_D MIX 0 


#define ZERO_MIX 1 
#define ONE_MIX 2 
#define D_MIX 3 
#define NOT_S MIX 4 
#define D_XOR_S_ MIX 5 
#define NOT_D XOR_S_ MIX 6 
#define S_MIX 7 
#define NOT_D_OR_NOT_S MIX 8 
#define D_OR_NOT_S MIX 9 
#define NOT_D_OR_S MIX 10 
#define D_OR_S MIX 1i 
#define D_AND S MIX 12 
#define NOT_D_ AND S MIX 13 
#define D_AND_NOT S MIX 14 


#define NOT_D_AND_NOT_S_MIX 15 
#define D_PLUS_S_DIV2_MIxX 23 


// Status codes used in MODECFG structure */ 
#define LINEAR_APERTURE_ENABLED 1 
#define LINEAR_APERTURE_DISABLED 0 
#define VGA_APERTURE_ENABLED 1 
#define VGA_APERTURE_DISABLED 0 


// Generic color definitions for use in getcolorcode() function 


#define BLACK 0 
#define DARKBLUE 1 
#define DARKGREEN 2 
#define DARKCYAN 3 
#define DARKRED 4 
#define DARKMAGENTA 5 
#define BROWN 6 
#define LIGHTGRAY 7 
#define DARKGRAY 8 
#define LIGHTBLUE 9 
#define LIGHTGREEN 10 
#define LIGHTCYAN 11 
#define LIGHTRED 12 
#define LIGHTMAGENTA 13 
#define YELLOW 14 
#define WHITE 15 
PRG888GX0-01 © 1993 ATI Technologies Inc. 


F-28 Proprietary and Confidential 


SAMPLE.H 


// Color codes for 4 & 8 bpp modes - palette entry based 


#define DARKBLUES 0x00000001 
#define DARKGREENS 0x00000002 
#define DARKCYANS8 0x00000003 
#define DARKRED8 0x00000004 
#define DARKMAGENTAS 0x00000005 
#define BROWNS 0x00000006 
#define LIGHTGRAYS8 0x00000007 
#define DARKGRAYS 0x00000008 
#define LIGHTBLUES 0x00000009 
#define LIGHTGREENS 0x0000000a 
#define LIGHTCYANS 0x0000000b 
#define LIGHTREDS 0x0000000c 
#define LIGHTMAGENTAS8 0x0000000d 
#define YELLOWS 0x0000000e 
#define WHITES Ox0000000F£ 
#define HIWHITES Ox000000fE 


// Color codes for 15 bpp (16 bpp 555 color weight) modes - direct mapped 


#define DARKBLUE15 0x00000014 
#define DARKGREEN15 0x00000280 
#define DARKCYANL5 0x00000294 
#define DARKRED15 0x00005000 
#define DARKMAGENTA15 0x00005014 
#define BROWN15 0x00005280 
#define LIGHTGRAY15 0x00005294 
#define DARKGRAY15 0x0000294a 
#define LIGHTBLUEL5 Ox0000001f 
#define LIGHTGREENI15 0x000003e0 
#define LIGHTCYANI15 Ox000003ff 
#define LIGHTRED15 0x00007¢c00 
#define LIGHTMAGENTA15 Ox00007c1E 
#define YELLOW15 0x00007fe0 
#define WHITE15 OxO000ffLfE 


// Color codes for 16 bpp (16 bpp 565 color weight) modes - direct mapped 


#define DARKBLUE16 0x00000014 
#define DARKGREEN16 0x000004c0 
#define DARKCYAN16 0x000004d4 
#define DARKRED16 0x0000a000 
#define DARKMAGENTA16 0x0000a014 
#define BROWN16 0x0000a4c0 
#define LIGHTGRAY16 0x0000a4d4 
#define DARKGRAY16 0x0000528a 
#define LIGHTBLUE16 ox0000001f 
#define LIGHTGREEN16 0x000007e0 
#define LIGHTCYAN16 Ox000007£F 
#define LIGHTRED16 ox0000fs00 
#define LIGHTMAGENTA16 Ox0000fs1ift 
#define YELLOW16 ox0000ffed 
#define WHITE16 OxO000fEfE 
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// Color codes for 24 bpp RGB color weight modes - direct mapped 


#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 


DARKBLUE24_RGB 
DARKGREEN2 4_RGB 
DARKCYAN24_RGB 
DARKRED2 4_RGB 
DARKMAGENTA2 4_RGB 
BROWN24_RGB 
LIGHTGRAY2 4_RGB 
DARKGRAY24_RGB 
LIGHTBLUE2 4_RGB 
LIGHTGREEN2 4_RGB 
LIGHTCYAN2 4_RGB 
LIGHTRED24_RGB 
LIGHTMAGENTA24_RGB 
YELLOW24_RGB 
WHITE2 4_RGB 


0x0000009e 
0x00009e00 
0x00009e9e 
0x009e0000 
0x009e009e 
0x009e9e00 
0x009e9e9e 
0x00555555 
OxO000000fE 
Ox0000ff00 
Ox0000fELE 
Ox00f£0000 
Ox00ffooft 
OxO00fffCLO00 
OxO00fLfLCE 


// Color codes for 32 bpp RGBA color weight modes - direct mapped 


#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 


// Macro for 24 bpp alignment - x is expressed in 24 bpp 


#define 


DARKBLUE32_RGBA 
DARKGREEN3 2_RGBA 
DARKCYAN32_RGBA 
DARKRED32_RGBA 
DARKMAGENTA32_RGBA 
BROWN32_RGBA 
LIGHTGRAY32_RGBA 
DARKGRAY32_RGBA 
LIGHTBLUE32_RGBA 
LIGHTGREEN32_RGBA 
LIGHTCYAN32_RGBA 
LIGHTRED32_RGBA 
LIGHTMAGENTA32_RGBA 
YELLOW32_RGBA 
WHITE32_RGBA 


GET24BPPROTATION (x) 


0x00009e00 
0x009e0000 
0x009e9e00 
0x9e000000 
0x9e009e00 
0x9e9e0000 
Ox9e9e9e00 
0x55555500 
Ox0000ff£00 
Ox00EFL0000 
OxO0fLEFLOO 
Oxff£000000 
OxffLooffoo 
OxfFfff0000 
Oxffffffo0o 


(unsigned long) (((x * 3) / 4) % 6) 


/* Engine bit constants - these are typically ORed together */ 


// GEN_TEST _CNTL register constants 


#define 
#define 


HWCURSOR_ENABLE 
GUI_ENGINE_ENABLE 


0x80 
0x100 


// CONFIG_CNTL register constants 


#define 
#define 


APERTURE_4M_ENABLE 
APERTURE_8M_ENABLE 


#define VGA_APERTURE_ENABLE 


1 
2 
4 


// DST_CNTL register constants 


#define 


#define 


DST_X_RIGHT_TO_LEFT 
DST_X_LEFT_TO_RIGHT 
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#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 


DST_Y_BOTTOM_TO_TOP 
DST_Y_TOP_TO_BOTTOM 
DST_X_MAJOR 
DST_Y_MAJOR 

DST_X_TILE 

DST_Y_TILE 
DST_LAST_PEL 
DST_POLYGON_ENABLE 
DST_24_ROTATION_ENABLE 


// SRC_CNTL register constants 


#define 
#define 
#define 
#define 
#define 
#define 


// HOST_CNTL register constants 


#define 


SRC_PATTERN_ENABLE 
SRC_ROTATION_ENABLE 
SRC_LINEAR_ENABLE 
SRC_BYTE_ALIGN 


or ON O 


0x10 
0x20 
0x40 
0x80 


or nd 


SRC_LINE_X RIGHT TO LEFT 0 
SRC_LINE_X LEFT TO RIGHT 0x10 


HOST_BYTE_ALIGN 


// PAT_CNTL register constants 


#define 
#define 
#define 


PAT_MONO_8x8_ENABLE 
PAT_CLR_4@x2_ENABLE 
PAT_CLR_8x1_ENABLE 


N 


// DP_PIX_WIDTH register constants 


#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 


DST_1BPP 

DST_4BPP 

DST_8BPP 

DST_15BPP 

DST_16BPP 

DST_32BPP 

SRC_1BPP 

SRC_ABPP 

SRC_8BPP 

SRC_15BPP 

SRC_16BPP 

SRC_32BPP 

HOST_1BPP 

HOST _4BPP 

HOST _SBPP 

HOST_15BPP 

HOST _16BPP 

HOST _32BPP 

BYTE _ORDER_MSB_TO_LSB 
BYTE _ORDER_LSB_TO_MSB 


// DP_MIX register constants 


#define 
#define 
#define 


BKGD_MTX_NOT_D 
BKGD_MIX_ZERO 
BKGD_MTX_ONE 


0 


ON PF WN EB 


0x100 
0x200 
0x300 
0x400 
0x600 

0 
0x10000 
0x20000 
0x30000 
0x40000 
0x60000 
0 
0x1000000 


- 


SAMPLE.H 
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#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 


// DP_SRC register constants 


#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 
#define 


// CLR_CMP_CNTL register constants 


#define 
#define 
#define 


BKGD_MIX_D 
BKGD_MIX_NOT_S 
BKGD_MIX_D_XOR_S 


BKGD_MIX_NOT_D_XOR_S 


BKGD_MTX_S 


BKGD_MTX_NOT_D_OR_NOT_S 


BKGD_MIX_D_OR_NOT_S 
BKGD_MIX_NOT_D_OR_S 


BKGD_MTX_D_OR_ 
BKGD_MTX_D_AND_S 


BKGD_MIX_NOT_D_AND_S 
BKGD_MTX_D_AND_NOT_S 
BKGD_MIX_NOT_D_AND_NOT_S 
BKGD_MIX_ D_PLUS_S_DIV2 


FRGD_MTX_NOT_D 
FRGD_MIX_ZERO 
FRGD_MIX_ONE 
FRGD_MIX_D 
FRGD_MTIX NOT_S 
FRGD_MIX _D XOR_S 


FRGD_MIX_NOT_D_XOR_S 


FRGD_MIX_S 


FRGD_MIX_NOT_D_OR_NOT_S 
FRGD_MTX_D_OR_NOT_S 
FRGD_MIX NOT_D_OR_S 


FRGD_MIX_D_OR_S 
FRGD_MIX_D_AND_S 


FRGD_MIX_NOT_D_AND_S 
FRGD_MIX_D_ AND _NOT_S 
FRGD_MIX_NOT_D_AND_NOT_S 
FRGD_MIX_D PLUS_S_ DIV2 


BKGD_SRC_BKGD_CLR 
BKGD_SRC_FRGD_CLR 
BKGD_SRC_HOST 
BKGD_SRC_BLIT 
BKGD_SRC_PATTERN 
FRGD_SRC_BKGD_CLR 
FRGD_SRC_FRGD_CLR 
FRGD_SRC_HOST 
FRGD_SRC_BLIT 
FRGD_SRC_PATTERN 
MONO_SRC_ONE 
MONO_SRC_PATTERN 
MONO_SRC_HOST 
MONO_SRC_BLIT 


COMPARE_FALSE 
COMPARE_TRUE 
COMPARE_NOT_EQUAL 
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15 

0x17 

0 
0x10000 
0x20000 
0x30000 
0x40000 
0x50000 
0x60000 
0x70000 
0x80000 
0x90000 
0xa0000 
0xb0000 
0xc0000 
0xd0000 
Oxe0000 
0xf£0000 


0x170000 


OP WNE O 


0x100 
0x200 
0x300 
0x400 

0 
0x10000 
0x20000 
0x30000 


= 
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#define 
#define 
#define 


// FIFO 
#define 


COMPARE_EQUAL 
COMPARE_DESTINATION 
COMPARE_SOURCE 


STAT register constants 
FIFO_ERR 


// CONTEXT LOAD CNTL constants 


#define 
#define 
#define 
#define 
#define 
#define 


CONTEXT _NO_LOAD 
CONTEXT_LOAD 
CONTEXT_LOAD_AND_DO_FILL 
CONTEXT_LOAD_AND_DO_LINE 
CONTEXT_EXECUTE 
CONTEXT_CMD_DISABLE 


// GUI_STAT register constants 


#define 
#define 
#define 
#define 
#define 
#define 


// Bus types from CONFIG_STATO bits 


#define 
#define 
#define 
#define 


// Dac types from CONFIG_STATO bits 


#define 
#define 
#define 
#define 
#define 
#define 
#define 


// Vga type from CONFIG_STATO bit 23 


ENGINE_IDLE 
ENGINE_BUSY 
SCISSOR_LEFT_FLAG 
SCISSOR_RIGHT_FLAG 
SCISSOR_TOP_FLAG 
SCISSOR_BOTTOM_FLAG 


BUS_ISA 
BUS_EISA 
BUS_VLB 
BUS_PCI 


DAC_TVP3020 
DAC_ATI68875 
DAC_BT476 
DAC_BT481 
DAC_ATI68860 
DAC_STG1700 
DAC_SC15021 


#define VGA_ENABLE 


#define 


VGA_DISABLE 


5 
0 


0x1000000 


0x80000000 


G 
0x10000 
0x20000 
0x30000 
0 


0x80000000 


Li:9 


SA OP WD 


1 
0 


/* Hardware cursor bitmap dimensions */ 
#define HWCURWIDTH 8 


#define 


/* Ensure that all structures are BYTE aligned -- same as /Zpl1 CL option */ 


HWCURHEIGHT 64 


#pragma pack(1) 


/* Global user structures */ 
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// width of hwcursor in words 
// height of hwcursor in scan lines 
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typedef enum {BLUE, GREEN, 


typedef struct 
{ 
int 
int width; 
int height; 
unsigned long color0; 
unsigned long colori; 


Yi 


RED} PRIMARYCOLOR; 


unsigned int bitmap[HWCURHEIGHT * HWCURWIDTH] ; 


} HWCURSOR; 


typedef struct 
{ 
int red; 
int green; 
int blue; 
} PALETTE; 


typedef struct 
{ 

int x, 
} POINT; 


Yi 


typedef struct 
{ 

int left, top, 
} SCISSOR; 


right, bottom; 


typedef struct 
{ 
int 
int 


int 


xres; 
yres; 
bpp; 
int depth; 
int pitch; 
int vga_aperture_status; 
int linear_aperture_status; 
int linear_aperture_size; 
unsigned long aperture_address; 
unsigned long vga_memreg_offset; 


/* 
/* 


unsigned long linear_memreg_offset; 


} MODECFG; 
typedef struct /* 
{ 

int 
char 
char 
int 
char 
char 


size; 

revision; 
mode_tables; 
mode_table_offset; 
mode_table_size; 
vga_type; 


unsigned 
unsigned 
unsigned 
unsigned 
unsigned 


unsigned /* 


4, 8, 16, 24, 32 */ 
555 for 15 bpp, 565 for 16 bpp */ 


/* VGA_APERTURE_ENABLE, VGA_APERTURE_DISABLE */ 
/* APERTURE_4M_ ENABLE, APERTURE_8M_ENABLE */ 
/* APERTURE_4M_ENABLE, APERTURE_8M_ENABLE */ 


/* linear aperture address */ 


See MACH 64 BIOS extension info */ 


See VGA types */ 
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unsigned int asic_id; 
unsigned char vga_boundary; 
unsigned char memory_size; 


unsigned char dac_type; /* See DAC types */ 
unsigned char memory_type; 
unsigned char bus_type; /* See BUS types */ 


unsigned char monitor_cntl; 
unsigned int aperture_addr; 
unsigned char aperture_cfg; 
unsigned char color_depth_support; 
unsigned char ramdac_feature_support; 
unsigned char reservedl1; 
unsigned int reserved2; 
unsigned int io_base; 
unsigned char reserved3 [6]; 

} QUERY_STRUCTURE; 


/* Resume word alignment */ 
#pragma pack() 


/* Function declarations */ 


// Routines from ATTR.C 

unsigned long get_color_code(int generic_color); 

void set_fg_color(unsigned long color); 

unsigned long get_fg_color(void) ; 

void set_bg color(unsigned long color); 

unsigned long get_bg_color(void) ; 

void set_fg_mix(int mix); 

int get_fg_mix(void); 

void set_bg mix(int mix); 

int get_bg_mix(void); 

unsigned long get_primary_color(PRIMARYCOLOR primarycolor, 
unsigned long color); 


// Routines from DRAW.C 

void blit(int x1, int yl, int x2, int y2, int width, int height); 
-void clear_screen(int x, int y, int width, int height); 

void draw_rectangle(int x, int y, int width, int height); 

void draw_line(int x1, int yl, int x2, int y2); 

void draw_line24(int x1, int yl, int x2, int y2); 


// Routines from HWCURSOR.C 

void set_hwcursor(int y, int width, int height, 
unsigned long color0, unsigned long colorl, 
unsigned int *bitmap) ; 

void enable_hwcursor (void) ; 

void disable_hweursor (void) ; 

void set_hwcursor_pos(int x, int y); 

void get_hwcursor_pos(POINT *position) ; 
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// Routines from INIT.c 


int 

void 
void 
void 
void 
void 
void 
void 
void 
int 

void 
void 
void 
void 
vold 
void 
void 
int 

void 


detect_mach64 (void) ; 
init_4bpp (void) ; 
init_S8bpp (void) ; 
init_15bpp (void) ; 
init_1é6ébpp (void); 
init_24bpp (void) ; 
init_32bpp (void) ; 
reset_engine(void) ; 
init_engine (void) ; 


open_mode(int mode_code, int pitch_code, int color_code); 


close_mode (void); 

enable _vga_aperture (void) ; 

disable _vga_aperture (void) ; 
enable_linear_aperture(int aperture_size); 
Gisable_linear_aperture (void) ; 
init_aperture(vold) ; 
update_aperture_status (void); 
query_hardware (void) ; 

set_packed_pixel (void); 


// Routines from MEMREG.C 
vga_regw(unsigned int regindex, unsigned long regdata); 
unsigned long vga_regr(unsigned int regindex); 
void app_regw(unsigned int regindex, unsigned long regdata); 
unsigned long app_regr(unsigned int regindex) ; 


void 


void regw(unsigned int regindex, unsigned long regdata); 


unsigned long regr(unsigned int regindex) ; 


void 


iow(unsigned int loaddr, unsigned long iodata); 


unsigned long ior(unsigned int loaddr); 


// Routines from PALETTE.C 


void 


set_palette(int index, PALETTE entry); 


PALETTE get_palette(int index); 


void 


init_palette(voilid) ; 


// Routines from WAIT.C 
void wait_for_idle(void) ; 


void 
void 


wait_for fifo(int entries) ; 
terminate(int idle problem); 


// Routines from MOVEMEM.ASM 
int movemem(void far *ptr, unsigned long phyaddr, unsigned int nwords, int 
direction) ; 


// Routines from ROMCALLS.ASM 
int is_ati_rom(void) ; 

int load_mode_parms(int resolution_code, int pitch_code, int color _depth_code); 
int set_display_mode(int display_mode) ; 
int load_and_set_mode(int resolution_code, int pitch_code, int 
color_depth_code) ; 

int get_query_size(int info_type); 
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int fill query structure(int info_type, unsigned char far *ptr); 
/* External variable and array declarations */ 


extern MODECFG modeinfo; 

extern HWCURSOR cursordata; 
extern QUERY_STRUCTURE querydata; 
extern int mem_cntl; 
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SS ESS SSS SSS PSS SS SSeS SSS SSS SSS SSS SSS SS SESS SSS SSS SSS SSS SS SSS sss sess / 


#include 
#include 
#include 
#include 
#include 


#include 
#include 


<stdio.h> 
<stdlib.h> 
<string.h> 
<conio.h> 
<dos.h> 


“atim6é4.h” 
“sample.h” 


[Rm rence reree- 


DRAW_RECTANGLE - draw a filled rectangle 


A filled rectangle is drawn at (x, y) of size (width x height) using the 
current engine settings. For 24 bpp modes, the engine is actually in 8 
bpp mode and the CRTC is in 24 bpp. For this reason, all horizontal 
parameters must be multiplied by 3. Also, the 24 bpp alignment must 

be determine for the engine to draw pixels in the correct color. Note that 
for 24 bpp modes, the input parameters for this routine are in 24 bpp. 

The rectangle source is determined by the current setting of the DP_SRC 
register. 


void draw_rectangle(int x, int y, int width, int height) 
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unsigned long temp, rotation; 
wait _for_idle(); // wait for idle before reading GUI registers 


// save used registers 
temp = regr(DST_CNTL); 


if (modeinfo.bpp == 24) 

{ 
// adjust horizontal parameters 
x= x * 3; 
width = width * 3; 


// set 24 bpp alignment while maintaining direction bits 
rotation = GET24BPPROTATION (x) ; 
regw(DST_CNTL, (temp & Oxdf) | 

DST_24 ROTATION_ENABLE | 

(rotation << 8)); 
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} 


// perform rectangle fill 

regw(DST_X, (unsigned long)x); 
regw(DST_Y, (unsigned long)y); 
regw(DST_HEIGHT, (unsigned long)height) ; 
regw(DST_WIDTH, (unsigned long) width) ; 


// restore 
regw(DST_CNTL, temp) ; 
/* mem ee te em Oe ie cme ne com Oe om ems OS GD a DO om moe ee cos Oe es cee ee sss in es ms 


CLEAR_SCREEN - clear a region of video memory 


A BLACK filled rectangle is drawn at (x, y) of size (width x height). 
The current engine mix setting will not affect the operation. 


ee ww one oe oe ee ee we Oe GS ten om Om Om 0 om Om Om oe Om Om OS oe me Oe 9 Oe oe Om Om ee De ee Oe em oe oe ee eee ee es ee * / 
void clear_screen(int x, int y, int width, int height) 
{ 
unsigned long temp; 
wait_for_idle(); // wait for idle before reading GUI registers 
// save used registers 
temp = regr(DP_MIX); 
// perform clear screen 
regw(DP_MIX, FRGD_MIX_ ZERO [| BKGD_MIX ZERO); 
draw_rectangle(x, y, width, height); 
// wait for clear screen operation to finish before exiting 
wait_for_idle(); 
// restore DP_MIX register 
regw(DP_MIX, temp); 
} 
Dip pee eset aah ee eens een ar ieee otal aete eh senile eee to 
DRAW_LINE - draw a line from (xl, yl) to (x2, y2) 
The drawing of the last pixel in the line is determined by the current 
setting of the DST_CNTL register (LAST_PEL bit). 
ae ee ee ee ee ee ee ee * / 
void draw_line(int x1, int yl, int x2, int y2) 
{ 
int dx, dy; 
int small, large; 
int x_dir, y_dir, y_major; 
unsigned long err, inc, dec, temp; 
/* call specific routine if mode is in 24 bpp */ 
© 1993 ATI Technologies Inc. PRG888GXO-01 


Proprietary and Confidential F-39 


DRAW.C 


if (modeinfo.bpp == 24) 

{ 
draw_line24(x1, yl, x2, y2); 
return; 

} 


/* determine x & y deltas and x & y direction bits */ 
1f (xl < x2) 
{ 
ax = x2 - x1; 
x_dir = 1; 
} 
else 


1f (yl < y2) 
; 
dy = y2 - yl; 


y_dir = 2; 

} 

else 

{ 
dy = yl - y2; 
y_dir = 0; 

} 


/* determine x & y min and max values; also determine y major bit */ 
1f (dx < dy) 


{ 
small = dx; 
large = dy; 
y_major = 4; 
} 
else 
{ 
small = dy; 
large = dx; 
y_major = 0; 
} 


/* calculate bressenham parameters and draw line */ 
err = (unsigned long) ((2 * small) - large); 

inc = (unsigned long) (2 * small); 

dec Ox3ffff - ((unsigned long) (2 * (large - small))); 


wait_for_idle(); // wait for idle before reading GUI registers 


// save used registers 
temp = regr(DST_CNTL) ; 
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// @araw bressenham line 
regw(DST_X, (unsigned long)x1); 
regw(DST_Y, (unsigned long)y1); 


DRAW.C 


// allow setting of last pel bit and polygon outline bit for line drawing 


regw(DST_CNTL, (temp & 0x60) ! (unsigned long) (y_major | y_dir | x_dir)); 
regw(DST_BRES_ERR, err); 
regw(DST_BRES_INC, inc); 
regw(DST_BRES DEC, dec); 
regw(DST_BRES_LNTH, (unsigned long) (large + 1)); 
// restore 
regw(DST_CNTL, temp); 
} 
/* eh is: iwi vs: ih: sem. am’ Sig, ite‘; Gah Sui i fet ‘ings ek, nel Sl. st: i eae: “Al: ease i’ i ms ‘ili’ id: iy: i 
DRAW_LINE24 - draw a bressenham line from (x1, yl) to (x2, y2). 
Since the engine does not directly support 24 bpp modes, it is set 
to 8 bpp while the CRTC is set to 24 bpp display mode (RGB). For 
rectangle drawing, all X coordinates and widths must be converted 
to 8 bpp sizes. This is done by taking the 24 bpp value and 
multipling it by 3. 
ee cee cers mee ea ue mom om ome mn DD OD OD OE es wae em Oe OS me Oe ome om fe om ee ca me em ee oe eee ee ee em me me ot eee ee em ee oe ee ee es */ 


void draw_line24(int x1, int yl, 
{ 


int x2, int y2) 


dy; 
iner2, iner3; 
templ1, temp2; 


int x, y, xend, yend, dx, 
int dad, -incri; 
unsigned long rotation, 


// save register 
wait_for_idle(); 

templ = regr(DST_CNTL) ; 
temp2 = 0xa3; 


abs(x2 - x1); 
= abs(y2 - yl); 


// 
if 


check slope 
(dy <= dx) // slope <= 1 


1f (xl > x2) 


{ 
x = ; 
Y = y2; 
xend = x1; 


© 1993 ATI Technologies Inc. 
Proprietary and Confidential 


PRG888GXO0-01 
F-4] 


DRAW.C 


- y2; 


} 


d= (2 * dy) - ax; 
incrl = 2 * dy; 

incr2 = 2 * (dy - dx); 
incr3 = 2 * (dy + dx); 


regw(DST_HEIGHT, 
regw(DST_Y, y); 


L)3 


do 

{ 
wait_for_fifo(4); 
rotation = GET24BPPROTATION (x) ; 
regw(DST_CNTL, temp2 | (rotation << 
regw(DST_X, x * 3); 
regw(DST_WIDTH, 3); 


X++; 


if (da >= 0) 
{ 
if (dy <= 0) 
{ 
ad = da + incri; 
} 
else 
{ 
Y+t+; 
regw(DST_Y, y); 
ad = da + incr2; 


} 
else 
{ 
if (dy >= 0) 
{ 
d=d + incri; 
} 
else 
{ 
B dota | 
regw(DST_Y, y); 
a@=d4 + incr3; 
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} 
} 

} while (x <= xend); 
} 
else // slope > 1 
{ 

1f (yl > y2) 

{ 


yY = y2; 
x = X2; 
yend = yl; 


else 

{ 
y = yl; 
x = Xi; 
yend = y2; 


ax = x2 - x1; 
} 


a= (2 * ax) - dy; 
incril = 2 * dx; 

incr2 2 * (dx - dy); 
incr3 = 2 * (dx + dy); 


regw(DST_HEIGHT, 1); 


do 

{ 
wait _for_fifo(3); 
rotation = GET2Z4BPPROTATION (x) ; 
regw(DST_CNTL, temp2 | (rotation << 8)); 
regw(DST_Y_ X, ((unsigned long) (x * 3) << 16) | y); 
regw(DST WIDTH, 3); 


Y++; 


1f (da >= 0) 
{ 
if (dx <= 0) 
{ 
ad =d + incri; 
} 
else 
{ 
X++; 
a@=d + incr2; 


else 
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1f£ (ax >= 0) 


{ 

d= d+ incri; 
} 
else 
{ 

x--7 

ad=d + incr3; 
} 


} 
} while (y <= yend); 
} 


// restore register 
wait_for_fifo(1); 
regw(DST_CNTL, temp1); 


BLIT - perform a screen to screen blit 


Copy the contents of screen memory at (x1, y1) of size (width x height) to 
(x2, y2) using the current engine settings. This is known as an unbounded 

Y source trajectory blit. For 24 bpp modes, the engine is in 8 bpp and 

the CRTC is in 24 bpp. For this reason, all horizontal parameters must be 

mulitplied by 3. The blit source is determined by the current setting 

of the DP_SRC register. 


void blit(int x1, int y1, int x2, int y2, int width, int height) 
{ 
unsigned long temp, rotation; 


wait_for_idle(); // wait for idle before reading GUI registers 


// save used registers 
temp = regr(DST_CNTL) ; 


if (modeinfo.bpp == 24) 
{ 
// adjust horizontal parameters 
x1 = x1 * 3; 
X2 = X2 * 3; 
width = width * 3; 


// get 24 bpp alignment rotation for destination 
rotation = GET2Z24BPPROTATION (x2); 


// set 24 bpp alignment while maintaining direction bits 
regw(DST_CNTL, (temp & Oxdf) | 

DST_24 ROTATION ENABLE | 

(rotation << 8)); 
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// perform a blit 
regw(SRC_X, x1); 
regw(SRC_Y, y1); 
regw(SRC_HEIGHT1, height); 
regw(SRC_WIDTH1, width) ; 


regw(DST_X, x2); 
regw(DST_Y, y2); 
regw(DST_HEIGHT, height) ; 
regw(DST_WIDTH, width) ; 


// restore 
regw(DST_CNTL, temp); 
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INIT.C 


INIT.C 
MACH 64 functions to initialize and set a standard engine context. 


Copyright (c) 1993-94 ATI Technologies Inc. All rights reserved 


#include <stdio.h> 
#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 
#include <dos.h> 


#include “atim64.h” 
#include “sample.h” 


Getect_mach64 - determine if a mach64 based video adapter is installed. 


This routine identifies if a mach6é4 based video adapter is installed. This 
is done by writing and reading the SCRATCH_REGO or SCRATCH_REG1 io based 
registers. These registers must be saved and restored since the mach64 
BIOS uses their contents. 


Returns YES MACH64 if detected, NO _MACH64 if not. 


ea eee eee ee eee ee et eee Oe on me Oe OD OD Oe me oe Oe ee ee ee oe es ee ee em oe en ms ee ew we me om oe oe om ms */ 
int detect_mach64 (void) 
{ 

unsigned long save_value; 

int result; 

// assume failure 

result = NO_MACH64; 

// check for ATI rom signature 

if (is_ati_rom( ) == 0) 

{ 

return (NO_MACH64) ; 

} 

// save old value 

save_value = lor(LOSCRATCH_REGO) ; 

// test odd bits for readability 

Low(LoSCRATCH_ REGO, 0x55555555); 

1£ (iLor(ioSCRATCH_REGO) == 0x55555555) 

{ 

// test even bits for readability 
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Llow(LOSCRATCH REGO, Oxaaaaaaaa) ; 
1f (Llor(LoSCRATCH REGO) == Oxaaaaaaaa) 


{ 
result = YES _MACH64; 


// restore old value 
Low(LoSCRATCH REGO, save_value) ; 


return (result); 


INIT_4BPP - set the MACH64 engine to a 4bpp standard context. 


This routine is used in conjuction with INIT_ENGINE(). 


re ee ee eee ee at */ 
void init_4bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_WIDTH, HOST_4BPP | SRC_4BPP | DST_4BPP | 
BYTE _ORDER_MSB TO _ LSB); 
regw(DP_CHAIN MASK, 0x8888); : 
} 
/ es ees ee sew ie, tai Sie tn me aa ec a soe “Gah ak, ah, te: te hin Ga Smee: i sme Se Sis ent ls let, d- nim- i 
INIT 8BPP - set the MACH64 engine to a 8bpp standard context. 
This routine is used in conjuction with INIT_ENGINE(). 
we oe ne ee ee ee eee */ 
void init 8bpp(void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_WIDTH, HOST_8BPP | SRC_8BPP | DST_8BPP | 
BYTE _ORDER_LSB_TO_MSB); 
regw(DP_CHAIN MASK, 0x8086); 
} 
/ Be ee we ww ee me we we we en oe os en ee en es ee me ons ee me we an ae em me OD oe ae es os se ee moms eo oe 
INIT_15BPP - set the MACH64 engine to a 15bpp standard context. 
This routine is used in conjuction with INIT_ENGINE(). 
a een ee ee ee ee eee eee ee eee eee * / 
void init_15bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_WIDTH, HOST_15BPP | SRC_15BPP | DST_15BPP | 
BYTE_ORDER_LSB_TO_MSB) ; 
regw(DP_CHAIN MASK, 0x4210); 
} 
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INIT_16BPP - set the MACH64 engine to a 16bpp standard context. 


This routine is used in conjuction with INIT_ENGINE(). 


ra Sas ess <tc Sa ei i'm co Si “ms us sesh n/c un, aon (ah “hes ms Gab us“ Gn, ses es Sth cena mk SS nob i en” “Se ese ns “twas ms ou Ss (GS Gasca" cs ssa esa * / 
void init_16bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX WIDTH, HOST_16BPP | SRC_16BPP | DST_16BPP | 
BYTE_ORDER_LSB_TO_MSB) ; 
regw(DP_CHAIN_MASK, 0x0410); 
} 
/ Fe se ae we es wm ee we we we we wee tre ce ee ew we we we we we wes os we oe ee we oe me om ees co ae me 0s mem te ees ens oe ee me we cme ee ee wee ome me ee me me me me oe 
INIT _24BPP - set the MACH64 engine to a 24bpp standard context. 
This routine is used in conjuction with INIT_ENGINE(). 
men a we we em we oe et ee Oe Oe We Oe Om Ot Om 0m Oe Bin Oe Om Om Om O_O Om Os Ons om Oe Om oe Om On OS Oe nb Oe Om On tm me OOD Om oe oh Om en me Oe ee Oe Oe ee * / 
void init_24bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX_ WIDTH, HOST_8BPP | SRC_8BPP | DST_SBPP | 
BYTE _ORDER_LSB_TO_MSB) ; 
regw(DP_CHAIN MASK, 0x8080); 
} 
/ Cnn ene a eee ee 
INIT _32BPP - set the MACH64 engine to a 32bpp standard context. 
This routine is used in conjuction with INIT_ENGINE(). 
ene eee ee ee ee ee ee ee ee ee ee ee ee ene oe ee ee cee eee eee re oe ee eo eee eee eee ee ee ee * / 
void init_32bpp (void) 
{ 
wait_for_fifo(2); 
regw(DP_PIX WIDTH, HOST_32BPP | SRC_32BPP | DST_32BPP | 
BYTE_ORDER_LSB_TO_MSB) ; 
regw(DP_CHAIN_ MASK, 0x8080); 
} 
/* = es mee ow om Om oe Ow Ow OD Om Om OD OF cen On > OD om On oe me ee ee ee ee ee 
RESET ENGINE - reset engine and clear any FIFO errors 
This function resets the GUI engine and clears any FIFO errors. 
ee wee ee ce ee eee ee fee ane came ames ts em me meets ene me mmm me mite Oe Ss re tr Unt OD ems es mn sm Ot SO OE Oe eee es ee ee ee ee ee ce ee ee oe ee */ 
void reset_engine(void) 
{ 
// reset engine 
iow(LoGEN_TEST_CNTL, 0); 
Low(ioGEN_TEST_CNTL, GUI_ENGINE_ENABLE) ; 
// Ensure engine is not locked up by clearing any FIFO errors 
low(LoBUS_CNTL, (ior(LoBUS_CNTL) & OxffOO0ffff) | Ox00ae0000); 
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/* ee a a es a we ee ws we ee ee ee ee ee ee ee ee eee eee eee te seme ee eee eee ee ete oe ee ee oe eee ee ee ee 
INIT_ENGINE - set the MACH64 engine to a standard context. 


This routine configures the MACH64 engine to a known typical context. 
The context consists of: 


~engine reset and enabling 
-fifo control including clearing fifo overflow errors 
-source and destination pitch 
-source and destination offset into video memory 
-color depth 
host data control 
-~pattern data control 
-line and rectangle control 
-color source, mix, and compare control 
-scissor settings 


-write mask 


void init_engine (void) 
unsigned long pitch_value, xres, yres; 


// determine modal information from global mode structure 

xres = (unsigned long) (modeinfo.xres) ; 

yres = (unsigned long) (modeinfo.yres) ; 

pitch value = (unsigned long) (modeinfo.pitch) ; 

if (modeinfo.bpp == 24) 

{ 
// In 24 bpp, the engine is in 8 bpp - this requires that all 
// horizontal coordinates and widths must be adjusted 
pitch_value = pitch_value * 3; 

} 


// Reset engine and clear any errors 
reset_engine(); 


// Ensure that vga page pointers are set to zero - the upper page 
// pointers are set to 1 to handle overflows in the lower page 
Low(loMEM_VGA_WP_SEL, 0x00010000); 
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iow(loMEM_VGA_RP_SEL, 0x00010000); 


// setup standard engine context 
wait _for_fifo(14); 


regw(CONTEXT MASK, OxFFFFFFFF) ; 


regw(DST_OFF_PITCH, (pitch_value / 8) << 22); 

regw(DST_Y_X, 0); 

regw(DST_HEIGHT, 0); 

regw(DST_BRES_ERR, 0); 

regw(DST_BRES_ INC, 0); 

regw(DST_BRES_DEC, 0); 

regw(DST_CNTL, DST_LAST PEL | DST_Y_TOP_TO_BOTTOM | DST_X_LEFT_TO_RIGHT) ; 


regw(SRC_OFF_PITCH, (pitch_value / 8) << 22); 
regw(SRC_Y_X, 0); 

regw(SRC_HEIGHT1_WIDTH1, 0); 

regw(SRC_Y_X START, 0); 


regw(SRC_HEIGHT2_WIDTH2, 0); 
regw(SRC_CNTL, SRC_LINE_X_LEFT_TO_RIGHT) ; 


walt_for_fifo(13); 
regw(HOST_CNTL, 0); 


regw(PAT_REGO, 0); 
regw(PAT REG1, 0); 
regw(PAT CNTL, 0); 


regw(SC_LEFT, 0); 

regw(SC_TOP, 0); 
regw(SC_BOTTOM, yres-1); 
regw(SC_RIGHT, pitch_value-1); 


regw(DP_BKGD_CLR, 0); 
regw(DP_FRGD_CLR, OxFFFFFFFF); 
regw(DP_WRITE_MASK, OxFFFFFFFF) ; 
regw(DP_MIX, FRGD_MIX_S | BKGD_MIX _D); 
regw(DP_SRC, FRGD_SRC_FRGD_CLR) ; 


wait_for fifo(3); 


regw(CLR_CMP_CLR, 0); 
regw(CLR_CMP_MASK, OxXFFFFFFFF) ; 
regw(CLR_CMP_CNTL, 0); 


switch (modeinfo.bpp) 

{ 
case 4 : init _4bpp(); break; 
case 8 : init 8bpp(); break; 
case 16: 


PRG888GX0-01 © 1993 ATI Technologies Inc. 
F-50 Proprietary and Confidential 


INIT.C 


if (modeinfo.depth == 555) 


{ 

init_15bpp(); // 555 color weighting 
} 
else 
{ 

init_16bpp(); // 565 color weighting 
} 
break; 


case 24: init_24bpp(); break; 
case 32: init_32bpp(); break; 
} 


wait_for_idle(); // insure engine is idle before leaving 


OPEN MODE - set an accelerator mode for the Mach 64 


This function sets an accelerator mode, initializes a global mode 
structure, and determines the access method for the GUI memory mapped 
registers of the Mach 64. This function should be called before any other 
engine function since they depend on the global mode information structure 
that is filled by this function. It is assumed that a Mach 64 based video 
adapter has been detected before calling this function (detect_mach64()). 


Inputs : 

Mode code: MODE_640x480, 
MODE_800x600, 
MODE_1024x768, 
MODE_1280x1024 


Pitch code: PITCH_1024, 


PITCH_XRES 
Color code: COLOR_DEPTH_4, // palettized 4 bpp 
COLOR_DEPTH_8, // palettized 8 bpp 


COLOR_DEPTH_15, // direct color 15 bpp - 555 weighting 
COLOR_DEPTH 16, // direct color 16 bpp - 565 weighting 
COLOR_DEPTH_24, // @Airect color 24 bpp RGB weighting 


COLOR_DEPTH_32 // @irect color 32 bpp - RGBa weighting 
Returns: 

NO_ERROR - the accelerator mode has been set successfully 

YES_ERROR - an error has occurred 

NOT_SUPPORTED - the accelerator mode is not supported 
oe he ee eee ee om EE SO GED em ts ea em Dm ee em me me ee Om ee Om ts oe me mm eet ee me ee cee ee Oe ee ee es eee eee a le ee ee * / 
int open_mode(int mode_code, int pitch_code, int color code) 

int retval; 
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// Setup apertures 
init_aperture(); 
update_aperture_status(); 


// Insure that memory is shared between VGA and acelerator for full 
// memory access 

mem_cntl = inpw(ioMEM_CNTL+2) ; 

outpw(lLoMEM_CNTL4+2, 0); 


// attempt to set an accelerator mode 
retval = load_and_set_mode(mode_code, pitch_code, color_code) ; 
if (retval == NO_ERROR) 
{ 
// Fill modal information structure 
modeinfo.depth = 0; 
switch (mode_code) 
{ 
case MODE 640x480: 
modeinfo.xres = 640; 
modeinfo.yres = 480; 
break; 


case MODE 800x600: 
modeinfo.xres = 800; 
modeinfo.yres = 600; 
break; 


case MODE_1024x768: 
modeinfo.xres = 1024; 
modeinfo.yres = 768; 
break; 


case MODE_1280x1024: 
modeinfo.xres = 1280; 
modeinfo.yres = 1024; 
break; 
} 
switch (pitch_code) 
{ 
case PITCH_1024: 
modeinfo.pitch 
break; 


1024; 


case PITCH_XRES: 
modeinfo.pitch 
break; 


modeinfo.xres; 


} 
switch(color_ code) 
{ 
case COLOR_DEPTH_4: 
modeinfo.bpp = 4; 
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break; 


case COLOR_DEPTH_8: 
modeinfo.bpp = 8; 
break; 


case COLOR_DEPTH_15: 
modeinfo.bpp = 16: 
modeinfo.depth = 555; 
break; 


case COLOR_DEPTH_16: 
modeinfo.bpp = 16; 
modeinfo.depth = 565; 
break; 


case COLOR_DEPTH_24: 
modeinfo.bpp = 24; 
break; 


case COLOR_DEPTH_32: 
modeinfo.bpp = 32; 
break; 
} 


// Insure that engine is clear of any idle or fifo errors 


reset_engine(); 


// Setup palette if 4 or 8 bpp mode 


if ((modeinfo.bpp == 4) || (modeinfo.bp 
{ 
init_palette(); 
} 
} 
else 
{ 


INIT.C 


// Insure VGA apeture is disabled for proper operation of VGA 


// controller (in text mode) before leaving 


disable_vga_aperture(); 


return (retval); 


/* ee ee er or wee ww we ee ee ee ee Be ew ee ee oe ee ee ee ee 


CLOSE_MODE - switch back to VGA mode from an accelerator mode 


This function closes a Mach 64 accelerator mode and switches the display 


back to VGA control. 


void close_mode (void) 
{ 
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union REGS regs; 


// Disable accelerator mode and go back to VGA mode 
set_display_mode(VGA_MODE) ; 


// Disable VGA aperture BEFORE reinitializing the VGA controller 
disable_vga_aperture(); 


// Since the accelerator and VGA controllers share the same video memory, 
// it is necessary to reinitialize the VGA text mode before exiting. Text 
// mode 3 is used here. 

regs.xX.ax = 3; 

int86(0x10, &regs, &regs) ; 


// restore MEM CNTL register 
outpw(iloMEM CNTL+2, mem_cntl); 


ENABLE_VGA_APERTURE - enable the vga aperture 


This function enables the VGA aperture and sets the modeinfo 
structure contents accordingly. The sample code functions will use the new 
settings. 


eee eee ee eee we ee Ee ee ee EE EE ee EEE ee ee EEE ee ee ee Ee eee ee ee */ 
void enable_vga_aperture (void) 
{ 
outpw(LoCONFIG_ CNTL, inpw(ioCONFIG_CNTL) | VGA_APERTURE_ENABLE) ; 
modeinfo.vga_aperture_status = VGA_APERTURE_ENABLED; 
} 
[RB eee ne ee wr ee we en ee en en en ee eee 


DISABLE_VGA_APERTURE - disable the vga aperture 


This function disables the VGA aperture and sets the modeinfo 

structure contents accordingly. The sample code functions will use the new 
settings. It is important to note that if both the linear aperture and 

the vga aperture are disabled, the sample code functions will fail to 
operate correctly. It is the application’s responsibility to insure that 
this does not occur. 


void disable_vga_aperture (void) 

{ 
outpw(ioCONFIG_CNTL, inpw(ioCONFIG_CNTL) & (~VGA_APERTURE_ENABLE) ); 
modeinfo.vga_aperture_status = VGA_APERTURE_DISABLED; 


[BR mew re ne ew ee ee i ne ee en ee ee ee eee eee en nenne 
ENABLE_LINEAR_APERTURE - enable the linear aperture 


This function enables the linear aperture for 4M or 8M. The linear aperture 
address is set by the ROM at boot time. The address is assumed to be 
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correct. The sample code functions will use the new settings. 


void enable _ linear _aperture(int aperture_size) 
{ 
1f (aperture_size == APERTURE_4M ENABLE) // 4M size 
{ 
outpw(loCONFIG_CNTL, (inpw(LoCONFIG_CNTL) & Oxfffc) | APERTURE_4M ENABLE) ; 
modeinfo.linear_memreg offset = 0x3ffc00; 
modeinfo.linear_aperture_size = APERTURE_4M ENABLE; 
} 
else // 8M size 
{ 
outpw(LoCONFIG_CNTL, (inpw(LoCONFIG_CNTL) & Oxfffc) | APERTURE_8M_ENABLE) ; 
modeinfo.linear_memreg_ offset = 0x7ffc00; 
modeinfo.linear_aperture_size = APERTURE_8M ENABLE; 


J *. eoweeens ede Seseer nea dee eeee ee ee aeeeeetwedeseessce See doy euse eee eee Sees eeee 
DISABLE_LINEAR_APERTURE - disable the linear aperture 


This function disables the linear aperture. It is important to note that 
1f both the linear aperture and the vga aperture are disabled, the PGL 
functions will fail to operate correctly. It is the application’s 
responsibility to insure that this does not occur. 


ee eee eee oe ee es we ae wm om wes ee es we wee tem cme eee ee te tees st ee ie Oe me ome ee mm ee ee eee ee */ 
void disable linear_aperture (void) 
{ 
outpw(LoCONFIG_CNTL, 
inpw(ioCONFIG_CNTL) & (~(APERTURE_4M ENABLE | APERTURE_8M_ENABLE) ) ); 
} 
/* om em eee ee oo ee eee ee ee eee ee oe on ms me es ee oe ee ee ee ee ee ee te we se ee oe 


INIT_APERTURE - set the apertures according to Mach 64 hardware 


This function selects the aperture according to the installed hardware 
for performance and functionality. This function is called by open_mode() 
each time a new mode is set. 


void init_aperture (void) 

{ 
// If the VGA controller is enabled, use VGA aperture. If it is disabled, 
// use the linear aperture. 
if (querydata.vga_type == VGA_ENABLE) 


{ 
// Enable VGA aperture 
outpw(LoCONFIG_CNTL, inpw(ioCONFIG_CNTL) | VGA_APERTURE_ENABLE) ; 
} 
else 
{ 
// Enable linear aperture 
outpw(ioCONFIG_CNTL, inpw(ioCONFIG_CNTL) | APERTURE_4M ENABLE) ; 
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UPDATE _APERTURE_STATUS - update aperture status for PGL register access 


This function is always called when a new mode is set through open_mode(). 
The aperture status information in the MODECFG structure is updated 
according the current aperture settings of the Mach 64 hardware. This 
function is also useful if the aperture settings change during an active 
session. The aperture status information is used by the sample functions to 
determine the method of access for the Mach 64 memory mapped registers. 


(em a ikki ih rs mle: sgn eh. Sh iti ne Wt fms eS a em Sm, ht i one en sn * / 
void update_aperture_status (void) 
{ 
int configli, config2; 
/* ---- determine status of VGA and linear apertures ---- */ 
configl = inpw(ioCONFIG_CNTL) ; 
config2 = inpw(ioCONFIG_CNTL+2) ; 
// determine addresses and offsets for linear aperture 
modeinfo.aperture_address = (unsigned long) (configl & Ox3ff0); 
modeinfo.aperture_address = modeinfo.aperture_address << 18; 
if ((configl & APERTURE_4M_ENABLE) == APERTURE_4M ENABLE) 
{ 
// 4M size 
modeinfo.linear memreg offset = O0x3ffc00; 
modeinfo.linear_aperture_size = APERTURE_4M ENABLE; 
} 
else 
{ 
// 8M size 
modeinfo.linear_memreg_offset = 0x7ffc00; 
modeinfo.linear_aperture_size = APERTURE_8M_ ENABLE; 
} 
// determine linear aperture status and address 
if ((configl & 3) != 0) 
{ 
modeinfo.linear_aperture._status = LINEAR_APERTURE_ENABLED; 
} 
else 
{ 
modeinfo.linear_aperture_status = LINEAR_APERTURE_ DISABLED; 
} 
// determine addresses and offsets for vga aperture 
modeinfo.vga_memreg_offset = O0xb000fc00; // segment: offset 
// determine vga aperture status 
if ((configl & VGA_APERTURE_ENABLE) == VGA_APERTURE_ENABLE) 
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{ 
modeinfo.vga_aperture_status = VGA_APERTURE_ENABLED; 
} 
else 
{ 
modeinfo.vga_aperture_status = VGA_APERTURE_DISABLED; 
} 
} 
[BR we en nn nn nn nn nn nn nn enn ee eee 
QUERY_HARDWARE - query the Mach 64 hardware and fill a global query 


structure 


This function calls the Mach 64 ROM to fill a global query structure that 
is used by other functions. It should be called before using any other 


function including init_aperture() and open_mode(). The return code should 
be checked before proceeding. 


Returns: 
NO_ERROR - the query structure has been successfully filled 
YES_ERROR ~ an error has occurred 
meee eee ee EE ER eR we Ee EE ER ER EO EE EE EM RE Ee ee eee * / as 
int query_hardware (void) ‘ 
{ 
int structure_size, error; 
unsigned char far *ptr; 
QUERY _STRUCTURE far *query_ptr; 
// assume error 
error = YES_ERROR; 
// call the BIOS query functions to obtain its size and contents 
structure_size = get_query_size(HEADER_ ONLY); 
if (structure_size != 0) 
{ 
// allocate memory buffer for query structure 
ptr = (unsigned char far *) malloc (structure_size); 
if (ptr != NULL) 
{ 
// call the BIOS to fill memory 
if (f111_query_structure(HEADER_ONLY, ptr) == NO_ERROR) 
{ 
// £111 global query structure from memory buffer 
query_ptr = (QUERY_STRUCTURE far *) (ptr); 
memcpy (&querydata, query_ptr, sizeof (QUERY_STRUCTURE) ); 
// set no error condition 
error = NO_ERROR; 
} 
free (ptr); 
} 
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J ee ceuie Ss ceenel Sie eee ree coen eee le cee eset oekt outeee Sessa weceeeaosee Se aceees 
SET PACKED PIXEL - set VGA controller into packed pixel mode 


The VGA controller is normally set in planar mode. Data transfer 
through the VGA aperture (low and high 32K pages) requires that the 
VGA controller be set in a packed pixel mode where the pixel data 
is arranged contigiously. 


com mam toon one umm cum cm cmD coms OD cm mm Om ome Oe Om OD Om mm om Oe Oe OD OD em oe tne Oe Oe Oe ee ee oe oe oe ee mee mm me oe ne om em me wm es os me me ee * / 
void set_packed_pixel (void) 
{ 
union REGS regs; 
// set VGA controller into packed pixel mode by setting superVGA mode 62h 
regs.X.ax = 0x62; 
int86(0x10, &regs, &regs); 
// set VGA controller space to 128k (AOQOOQOOh-BFFFFh) - this is necessary 
// to allow access to GUI registers in VGA space (BFCOOh) since setting 
// mode 62h sets VGA controller space to 64K (AQOQOOQOh-AFFFFh). 
outp(0x3ce, 6); 
outp(Ox3cf, 1); 
} 
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MACH 64 functions to access the memory mapped registers. 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


SSPSS SSeS SSeS Ses eS SSS SSS SS SSS SSS SSS SSS SSS SSE SSS SSS S SSS SSS sss ese SSS / 


#include <stdio.h> 

#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 

#include <dos.h> 


#include “atim64.h” 
#include “sample.h” 


/* ete tne See ten cee ate 8S em Sr aes ms ams ce ss mm Gm oom ee ome ‘con gn ee coms se eee me oO ee Oe ee Gee ee ee ee ee ee 
APP_REGW - write to a memory mapped register through the linear aperture. 


See REGW() for method description. 
void app_regw(unsigned int regindex, unsigned long regdata) 
{ 
unsigned long appaddr; 
unsigned long *dataptr; 
// calculate aperture address 
appaddr = modeinfo.aperture_address + modeinfo.linear_memreg_offset + 


regindex; 


// setup dataptr to write from 
dataptr = &regdata; 


movemem(dataptr, appaddr, 2, 1); 
/* (em cee ane ne ee cee cm cm mn cme Om Sem em owe ms f0ne Suh SUR nee OR GS mh ONS tm le caee em em Se or Ne tee me em tem me eS Om nO ae tee te ee ee ee em ee ee 
APP_REGR - read from a memory mapped register through the linear aperture. 


See REGR() for method description. 


ee cs oe ee ee eee oe eae es ee me te Ome ee Oe ee ee ee ee im ee ee me Om om ED GE Om OE Ge Me ee Oe re GE nm OH Me Oe ms ee Oe ee me Oe om Oe we oe oe om Oe ee me ee owe oe ee oe * / 
unsigned long app_regr(unsigned int regindex) 
{ 
unsigned long appaddr, regdata; 
unsigned long *dataptr; 
// calculate aperture address 
appaddr = modeinfo.aperture_address + modeinfo.linear_memreg_offset + 
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regindex; 


// setup dataptr to read to 
dataptr = &regdata; 


movemem(dataptr, appaddr, 2, 0); 


return (regdata); 


/* ee com cam ee am me me ns nn en ne em Oe OO os Oe te Oe OO re oe en Oe oe es Oe oe ee ss 
REGW - write to a memory mapped register through an available aperture. 


This function will provide write access to the memory mapped registers. 

Each register is 32 bits wide. The appropriate method is selected based 

which aperture is enabled. Preference is given to the small VGA aperture 
since this method will work on all bus types. It is assumed that one of 

the two apertures is enabled. 


VGA aperture method: 


This method will provide write access on all bus types. It is assumed 
that the VGA aperture is enabled. The base address of the memory mapped 
registers is BO00:FC0O0Oh. Each register occupies 4 bytes. This is also 
the fastest method for real mode applications. 


Linear aperture method: 


This method will provide write access on all bus types except ISA. ISA 
will work if the linear aperture can be enabled without mapping to 
existing extended memory. It is assumed that the linear memory aperture 
is enabled. The base address of the memory mapped registers is (base 
aperture address + 3FFC00h) for a 4M aperture size and (base aperture 
address + 7FFCOOh) for an 8M aperture size. Each register occupies 4 
bytes. This method will impose a heavy performance hit since the memory 
mapped registers exist in extended memory above real mode space. 


Pe Pm A PC an RPO ge pe em nS aS Pe ory nC ee eng * 
void regw(unsigned int regindex, unsigned long regdata) ‘ 
if (modeinfo.vga_aperture_status == VGA_APERTURE_ENABLED) 
*((unsigned long far *) (VGA_REGISTER_BASE + regindex)) = regdata; 
a if (modeinfo.linear_aperture_status == LINEAR_APERTURE_ENABLED) 
app_regw(regindex, regdata); 
} 
} 
[Re i we a a a a ee ee ee ee ee 
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REGR - read from a memory mapped register through an available aperture. 


This function will provide read access to the memory mapped registers. 
Each register is 32 bits wide. The appropriate method is selected based 
which aperture is enabled. Preference is given to the small VGA aperture 
since this method will work on all bus types. It is assumed that one of 
the two apertures is enabled. 


VGA aperture method: 


This method will provide read access on all bus types. It is assumed 
that the VGA aperture is enabled. The base address of the memory mapped 
registers is B000:FCOOh. Each register occupies 4 bytes. This is also 
the fastest method for real mode applications. 


Linear aperture method: 


This method will provide read access on all bus types except ISA. ISA 
will work if the linear aperture can be enabled without mapping to 
existing extended memory. It is assumed that the linear memory aperture 
is enabled. The base address of the memory mapped registers is (base 
aperture address + 3FFC0O0h) for a 4M aperture size and (base aperture 
address + 7FFCO0Oh) for an 8M aperture size. Each register occupies 4 
bytes. 


unsigned long regr(unsigned int regindex) 

if (modeinfo.vga_aperture_status == VGA_APERTURE_ENABLED) 
return (*((unsigned long far *) (VGA_REGISTER_BASE + regindex) )); 
ae if (modeinfo.linear_aperture_status == LINEAR_APERTURE_ENABLED) 
return(app_regr(regindex) ); 


IOW - write to an io mapped register in 32 bit wide data. 


This function will write to an io register in 32 bits. This is done in 
two 16 bit writes - first to the low word io address, then to the upper 
word address. 


{ 
outpw(ioaddr, (unsigned int) (lodata & Oxffff)); 
outpw(iloaddr+2, (unsigned int) ((lodata >> 16) & Oxffff)); 
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IOR - read from an io mapped register in 32 bit wide data. 


This function will read an io register in 32 bits. This is done in 
two 16 bit read - first from the low word io address, then from the upper 
word address. 


Use Sanukouetesaee Mime ewweclasceet eo SoReuemee ieee a teeee woe ee Se cue wees */ 
unsigned long ior(unsigned int iloaddr) 
{ 
unsigned long regdata; 
regdata = (unsigned long) (inpw(ioaddr+2) ); 
regdata = (unsigned long) ((regdata << 16) | (inpw(ioaddr))); 
return (regdata); 
} 
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WAIT.C 
MACH 64 functions to wait for engine idle 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


SSSSSS eee eS SSS SSS SSH SSS SSS SSE SS SSS SSS SSE SSS SS SES SSS SSS SSS SSS SSeS sess / 


#include <stdio.h> 
#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 
#include <dos.h> 


#include “atim64.h” 
#include “sample.h” 


unsigned long fifo_stat; 
unsigned long gui_stat; 


[PP peert ete tee ee ea tee ee te ee eee eee eee See eee ee ee eee 
WAIT_FOR_IDLE - wait until engine active bit is idle. 


This function uses the DOS tick counter to serve as a timeout clock. If 
the engine is in a lockup condition, the busy bit may stay set. In this 
case, a timeout will occur, an error message will occur, and the program 
will terminate. 


void wait_for_idle(void) 


{ 
unsigned int starttick, endtick; 


// insure fifo is empty before waiting for engine idle 
wait_for_fifo(16); 


starttick = *((unsigned int far *) (DOS_TICK_ADDRESS) ); 
endtick = starttick; 
while ((regr(GUI_STAT) & ENGINE_BUSY) != ENGINE_IDLE) 
{ . 
endtick = *((unsigned int far *) (DOS_TICK_ADDRESS) ); 
if (abs(endtick - starttick) > IDLE_TIMEOUT) 
{ 
fifo_stat = regr(FIFO_STAT) ; 
gui_stat = regr(GUI_STAT); 
terminate (BAD_IDLE) ; 
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WAIT _FOR_FIFO - wait n empty FIFO entries. 


The FIFO contains upto 16 empty entries. The ‘entries’ value must be 
1 to 16. This function implements the same timeout mechanism as the 
Waitforidle() function. 


void wait_for fifo(int entries) 


{ 
unsigned int starttick, endtick; 


starttick = *((unsigned int far *) (DOS_TICK_ADDRESS) ); 
endtick = starttick; 
while ((regr(FIFO_STAT) & Oxffff) > ((unsigned int) (0x8000 >> entries) ) ) 
{ 
endtick = *((unsigned int far *) (DOS_TICK_ADDRESS) ); 
1f (abs(endtick - starttick) > FIFO_TIMEOUT) 


{ 
fifo_stat = regr(FIFO_STAT); 
gui_stat = regr(GUI_STAT); 
terminate (BAD FIFO) ; 
} 
} 
} 
[® exeseiensoe esse eke seco esl eee Sse tees ee eee Stee eee ee ete ees 


TERMINATE - terminate the program due to a timeout from Waitforidle() or 
Waltforfifo(). 


void terminate(int idle_ problem) 

{ 
// @isable accelerator and switch back to VGA mode 
close_mode(); 


// print error message 
if (idle_problem == BAD _IDLE) 
{ 
printf (“Error. Timeout exceeded for engine idle.\n”); 
} 
else // BAD_FIFO 


{ 
printf(”“Error. Timeout exceeded for engine fifo.\n”); 


printf(”“Status at error:\n”); 

printf(”“ Idle status : %081x\n”, gui_stat & 1); 
printf(”“ Fifo status : %081x\n”, fifo_stat & Oxffff); 
printf(” Fifo overflow: “%081x\n”, fifo_stat >> 31); 


exit (1); 
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MOVEMEM. ASM 


=e 


Routine to move a block of data from or to extended memory. This routine 
will typically be used to write and read from the MACHG64 aperture memory. 


Compiling: 
masm /Ml /D<memory model> movemem.asm; 
<memory model> = mem_S for SMALL model, 
mem_M for MEDIUM model, 
mem_L for LARGE model 


me “ee Se Ne Se 6e lM lh UMO Se he Oe OO 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


=e 


IFDEF mem_S 


PARM equ 4 3; passed parameters start at bp+4 for small model 
ELSE 

PARM equ 6 3; Passed parameters start at bp+6 for other models 
ENDIF 


IFDEF mem_S 
-MODEL SMALL, C 
ELSEIFDEF mem_M 
-MODEL MEDIUM, C 


ELSE 
-MODEL LARGE, C 
ENDIF 
~DATA 
EVEN 
3; gdt structure required for INT 15h, function 87h call 
gat dw 0,0,0,0 
daw 0,0,0,0 
source_len dw Offffh 3; set to Offffh 
source_addr_low dw ? 3; 24 bit address 
source_addr_hi db ? 
source_access db 93h 3; set to 93h 
source_addr_ext dw 0 3; 386/486 address extensions 
target_len dw Offffh 3; set to Offffh 
target_addr_low dw ? 3; 24 bit address 
target_addr_hi db ? 
target_access db 93h 7 set to 93h 
target_addr_ext dw 0 3; 386/486 address extensions 
dw 0,0,0,0 
daw 0,0,0,0 
~ CODE 
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- 286 


° 
4 


Macro for ‘call’ model handling 


Mcall macro routine 
IFDEF mem_S 


call NEAR PTR routine 


ELSE 


call FAR PTR routine 


ENDIF 


Te) ee Ye) ee) OY YO | OT | Tn a a aD 


me 


endm 


MOVEMEM - Move a block of data to or from an extended address (at or above 
the 1Meg memory boundary). Int 15h, function 87h is used for this 
function. The input parameters are fetched from the stack. 


Inputs : FAR PTR: memory buffer address (segment :offset), 


DWORD : extended physical memory address, 
WORD : number of words to transfer (<= 32767 words), 
WORD : transfer direction flag 


0 -> READ (extended memory to memory buffer) 
1 -> WRITE (memory buffer to extended memory) 


Outputs: Return value from Int 15h, function 87h in ax 
0 for success, 
>0 for error code 


C declaration for this function: 


int movemem(void far *ptr, 
unsigned long dest, 
unsigned int nwords, 
int direction); 


The memory buffer address pointer is expected in segment:offset 
‘dest’ parameter is expected in physical address format. 


public movemem 


IFDEF mem_S 


movemem proc near 

ELSE 

movemem proc far 

ENDIF 
3; create frame pointer 
push bp 
mov bp, sp 


3; save registers used 


push bx 
push cx 
push ax 


format. The 
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push dai 
push si 
push es 


3; retrieve direction flag in di 
mov di, WORD PTR [bp+PARM+10] 


3; setup gdt structure 


mov source_len, Offffh 
mov target_len, Offffh 
mov source_access, 93h 
mov target_access, 93h 


7; Calculate physical address from PTR address for GDT in dx:cx 


mov dx, WORD PTR [bp+PARM+2] ; segment of PTR address 
mov cx, ax 
cle 
shr dx, Och 
shl cx, 4 
mov ax, WORD PTR [bp+PARM] 3; load offset of PTR address 
add cx, ax 
jnc mm 
inc ax 
mmi : 
3; put extended physical memory address in bx:ax 
mov ax, WORD PTR [bp+PARM+4] ; load low word of address 
mov bx, WORD PTR [bp+PARM+6] ; load high word of address 
3; £111 in address depending on direction flag 
cmp dai, 0 3; O = read 
je mm2 
3; WRITE: source = mem buffer, target = extended mem address 
mov source_addr_low, cx 
mov source_addr_hi, dl 
xor dl, dal 
mov source_addr_ext, dx 
mov target_addr_low, ax 
Mov target_addr_hi, bl 
xor bl, bl 
mov target_addr_ext, bx 
jmp mm3 
mm2 : 
7; READ: source = extended mem address, target = mem buffer 
mov source_addr_low, ax 
mov source_addr_hi, bl 
xor bl, bl 
mov source_addr_ext, bx 
mov target_addr_low, cx 
mov target_addr_hi, dl 
xor al, dal 
mov target_addr_ext, dx 
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mm3 : 


3; setup registers and call INT 15h, 
cx, WORD PTR [bp+PARM+8] 


mov 
mov 
push 
pop 
cle 


si, offset ds:gdt 
ds 
es 


ah, 87h 
15h 


return value 
al, ah 
ah, ah 


7; restore saved registers 


Pop 
Pop 
pop 
pop 
pop 
pop 


es 


-gi 


ai 
ax 
cx 
bx 


3; remove frame pointer 


mov 
Pop 


ret 


movemem endp 


end 


PRG888GX0-01 
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sp, bp 
bp 


ri 


function 87h, gdt = es:si 
load word transfer count 
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PALETTE.C 


PALETTE .C 
Functions to set, get, and initialize the palette. 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


#include <stdlib.h> 
#include <stdio.h> 
#include <string.h> 
#include <conio.h> 
#include <dos.h> 


#include “atim64.h” 
#include “sample.h” 


[BR wee eee re ee ee eee ee eee en eee eee mee ene eee eee ee ede nee nome ere sen 
SET PALETTE - set a palette entry. 


This function sets a specific palette entry. Each component has a range 
of 0 to 255. The index has a range of 0 to 255. 


void set_palette(int index, PALETTE entry) 


{ s 
/* set DAC write index */ 
outp(LoDAC_REGS, index); 


/* set red component */ 
outp (LODAC_REGS+1, entry.red) ; 


/* set green component */ 
outp (1LOoDAC_REGS+1, entry.green) ; 


/* set blue component */ 
outp(1LoDAC_REGS+1, entry.blue); 
/* ee ame toms ee tn ces in cies ee UH COD ie eats OO Ome come Gh SOD SOD OD DO: GD OFS GOD OD me OD Gt mS OOD ODS cree ee me ces Se Om mm Om mm cm mem ms mee ome me ome meee oe oe 


GET_PALETTE - get a palette entry. 


This function gets a specific palette entry. Each component has a range 
of 0 to 255. The index has a range of 0 to 255. 


PALETTE get_palette(int index) 
{ 
PALETTE entry; 


/* set DAC read index */ 
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INIT PALETTE - Set the palette to default values. 


outp(LoDAC_REGS+3, index) ; 


/* get red component */ 
entry.red = inp (ioDAC_REGS+1) ; 


/* get green component */ 
entry.green = inp(1oDAC_REGS+1); 


/* get blue component */ 
entry.blue = inp(ioDAC_REGS+1); 


return (entry); 


This function initializes the palette table by setting the entries toa 


set of default values. 


The first 16 entries are set to EGA/VGA colors. Each 


EGA/VGA color is replicated 16 times to fill the 256 palette entries. 


void init_palette(void) 


{ 


int i, j, index; 
PALETTE entry[16] = 
{ 


{ 0, 0, 0}, // black 
{ 0, O, 168}, // blue 
{ 0, 168, 0}, // green 
{ 0, 168, 168}, // cyan 
{ 168, 0, 0}, // red 
{ 168, 0, 168}, // magenta 
{ 168, 168, 0}, // brown 
{ 168, 168, 168}, // light gray 
{ 84, 84, 84}, // gray 
{ 0, 0, 255}, // light blue 
{ 0, 255, 0}, // light green 
{ 0, 255, 255}, // light cyan 
{ 255, 0, 0}, // light red 
{ 255, 0, 255}, // light magenta 
{ 255, 255, 0}, // yellow 
{ 255, 255, 255} // white 

}; 

// set first 16 entries 

for (index = 0; index < 16; index++) 


{ 


set_palette(index, entry[index]); 


// set other entries by replicating the first 16 entries 


index = 16; 
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for (1 = 1; i < 16; i++) 
{ 
for (j = 0; Jj < 16; j++) 
{ 
set_palette(index, entry[i]); 
index++4+; 
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ROMCALLS.ASM 


ROMCALLS.ASM 
Interface routines to access MACH64 ROM services. 


Compiling: 
masm /Ml /D<memory model> romcalls.asm; 
<memory model> = mem_S. for SMALL model, 
mem_M for MEDIUM model, 
mem_L for LARGE model 


A eh, i i a i | i a} 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


=e 


include atim64.inc 


IFDEF mem_S 


PARM equ 4 3} passed parameters start at bp+4 for small model 
ELSE 

PARM equ 6 3 passed parameters start at bp+6 for other models 
ENDIF 


IFDEF mem_S 
-MODEL SMALL, C 
ELSEIFDEF mem_M 
-MODEL MEDIUM, C 


ELSE 
-MODEL LARGE, C 
ENDIF 
- DATA 
ati_sig db ‘761295520’, 0 
rom_addr dw 64h 
dw 0c000h 
db 0 
«CODE 
-286 


3 Macro for ‘call’ model handling 
Mcall macro routine 
IFDEF mem_S 

call NEAR PTR routine 


ELSE 
call FAR PTR routine 
ENDIF 
endm 
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IS_ATI_ROM 


Check if ROM has ATI signature. Normally, the starting location of the 9 
character signature is at ROM_SEGMENT:0031h. In an MCA system, this is not 
the case and can be anywhere in the first 128 bytes of the ROM. For this 
reason, a signature scan method is used. 


Inputs : none 


me “e “oe Se Be “Ne Se Ne te NO NO 


Outputs: Returns 1 in ax if signature was found, 0 if not 


™e 


public is_ati_rom 


IFDEF mem_S 


is_ati_rom proc near 

ELSE 

is_ati_rom proc far 

ENDIF 
3; save registers used 
push bx 
push cx 
push ax 
push dai 
push si 
push es 


3; setup ati signature string scan 
Mcall rom_base get rom segment in ax 


mov ax, 0 3; assume no ATI ROM 

push ax 3; setup ROM search segment es 

pop es 

mov si, 0 3 start search at rom_seg:0000h 

mov di, offset ati_sig ; compare with ati_sig string 

mov bl, byte ptr ds:[di] ; load first character of string 

mov cx, 128 3; search first 128 bytes of ROM 
match_chk: 

mov al, byte ptr es:[si] ; get byte from ROM 

emp al, bl 3; does first character match 

jne next_char 


3; compare with ati signature string 


push si 7 save sl 
mov dx, 1 7; assume ATI ROM 
mov di, offset ati_sig ; compare with ati_sig string 
inc dai 3; start at second character 
inc si 
sig_check: 
mov al, byte ptr es:[si] 
mov ah, byte ptr ds: [di] 
inc si 
inc di 
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cmp ah, 0 3; 0 = end of string 
je end_check 
cmp al, ah 
je sig check 
mov ax, 0 3; not ATI ROM 
end_check: 
pop si 3 restore si 
cmp ax, 0 3 check for string match 
jne sig found 
next_char: 
inc si 


loop match _chk 


sig_found: 
mov ax, ax ; O = not found, 1 = found 


3; restore saved registers 


pop es 
pop si 
pop dai 
pop ax 
pop cx 
pop bx 
ret 


is_ati_rom endp 


ROM_BASE 
Retrieve base segment of MACH64 ROM. 


Inputs : none 


me te te Ne Ne NO te 


Outputs: Returns base rom segment in ax (usually C000h or C800h) 


we 


public rom_base 


IFDEF mem_S 


rom_base proc near 

ELSE 

rom_base proc far 

ENDIF 
3; save registers used 
push cx 
push ax 


; retrieve rom segment address from MACH64 register 
Mov dx, ioSCRATCH REG1 
in al, dx 
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and al, 7Fh 
mov ah, 0 

mov ¢1,.°7 

shl ax, cl 

add ax, 0C000h 


3; restore saved registers 


pop ax 

pop cx 

ret 
rom_base endp 


LOAD_MODE_PARMS 


stack. 


Inputs : WORD resolution code 
CH = 12h - 640x480 
CH = 6Ah - 800x600 
CH = 55h - 1024x768 
CH = 83h - 1280x1024 


WORD pitch code 
CL [bits 7-6] = 0 - 1024 
CL [bits 7-6] = 1 - don’t change 
CL [bits 7-6] = 2 - pitch size = 


WORD deep color code 
CL [bits 3-0] = 1- 4 bpp 
CL [bits 3-0] = 2 - 8 bpp 
CL [bits 3-0] = 3 - 15 bpp (555) 
CL [bits 3-0] = 4 - 16 bpp (565) 
CL [bits 3-0] = 5 - 24 bpp 
CL [bits 3-0] = 6 - 32 bpp 


Outputs: Returns error code in ax 
AX = 0 - no error 


Re 


AX = 2 - function not supported 


me we Ne Ne te Me te fe me ONO Oe Oe te OO Oe Oe ee OO Oe Oe Oe oe Oe Oe Se Ne te ee 


we 


public load_mode_parms 


IFDEF mem_S 
load_mode_parms proc near 
ELSE 

load_mode_parms proc far 
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resolution width 


1 - function complete with error 


ROMCALLS.ASM 


Load accelerator mode parameters to determine the mode to be invoked by a 
following call to SET_DISPLAY_MODE. Input parameters are fetched from the 
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ENDIF 
3 create frame pointer 
push bp 
mov bp, sp 


3; save registers used 
push ex 


3; setup parameters for call to ATI rom 


Mcall rom_base 7 get rom segment in ax 
mov rom_addr+2, ax 

mov ax, WORD PTR [bp+PARM] 3; get resolution code 
mov ch, al 

mov ax, WORD PTR [bp+PARM+2] 3; get pitch code 

shl al, 6 

mov Gl, ai 

mov ax, WORD PTR [bp+PARM+4] 3; get deep color code 
and als. 7 

or cl, al 

mov ax, 0 ; function code 0 

mov rom_addr, 64h 

call DWORD PTR rom_addr 3; ¢all ROM 


3; setup error code in AL 
mov al, ah 
xor ah, ah 


3; restore saved registers 
pop cx 


3; remove frame pointer 


mov sp, bp 
pop bp 
ret 


load_mode_parms endp 


SET_DISPLAY_MODE 


Set display to accelerator (after a call to LOAD _MODE_PARMS()) or VGA mode. 
Input parameters are fetched from the stack. 


Inputs : WORD display mode 
on 0 - VGA 
CL = 1 - Accelerator 


Outputs: Returns error code in ax 
AX = 0 - no error 
AX = 1 - function complete with error 
AX = 2 - function not supported 


Se i i, i i i Sn, i, Sn, Sa!) 
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public set_display_mode 


IFDEF mem_S 
set_display_ mode proc near 


ELSE 

set_display_mode proc far 

ENDIF 
3 ¢reate frame pointer 
push bp 
mov bp, sp 


3; save registers used 
push cx 


7; setup parameters for call to ATI rom 


Mcall rom_base 3; get rom segment in ax 
mov rom_addr+2, ax 

mov ax, WORD PTR [bp+PARM] + get display mode flag 
mov cl, al 

mov ax, -1 ; function code 1 

mov rom_addr, 64h 

call DWORD PTR rom_addr ; ¢all ROM 


3; setup error code in AL 
mov al, ah 
xor ah, ah 


3; restore saved registers 
pop cx 


3; remove frame pointer 


mov sp, bp 
pop bp 
ret 


set_display_mode endp 


LOAD_AND_SET_MODE 


Load accelerator mode parameters and set accelerator mode. Input parameters 
are fetched from the stack. . 


Inputs : WORD resolution code 
CH = 12h - 640x480 
CH = 6Ah - 800x600 
CH 55h - 1024x768 
CH 83h - 1280x1024 


A i, i a i i, | Se | a LD 
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WORD pitch code 
CL [bits 7-6] = 
CL [bits 7-6] 
CL [bits 7-6] = 


- 1024 
- pitch size 
WORD deep color code 


CL [bits 3-0] = 1 - 4 bpp 
CL [bits 3-0] = 2 - 8 bpp 


Outputs: Returns error code in ax 
AX = 0 - no error 


Re 


ue we Me Of Sy ee) ey ee, ee) ee) | | | | | | 


me 


public load_and_set_mode 


IFDEF mem_S 
load_and_set_mode proc near 


ELSE 

load_and_set_mode proc far 

ENDIF 
3 create frame pointer 
push bp 
mov bp, sp 


3; gave registers used 
push cx 


3; setup parameters for call to ATI rom 


Mcall rom_base 

mov rom_addr+2, ax 

mov ax, WORD PTR [bp+PARM] 
mov ch, al 

mov ax, WORD PTR [bp+PARM+2] 
shl al, 6 

mov cl, al 

mov ax, WORD PTR [bp+PARM+4] 
and al, 7 

or cl, al 

mov ax, 2 

mov rom_addr, 64h 

call DWORD PTR rom_addr 


7; setup error code in AL 
mov al, ah 
xor ah, ah 
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0 
= 1 - don’t change 
2 


CL [bits 3-0] 3 - 15 bpp (555) 
CL [bits 3-0] 4 - 16 bpp (565) 
CL [bits 3-0] = 5 - 24 bpp 
CL [bits 3-0] = 6 - 32 bpp 


= 1 - function complete with error 
AX = 2 - function not supported 


e 
rf 


me 


=e 


=e 


=e 


resolution width 


get rom segment in ax 
get resolution code 


get pitch code 


get deep color code 


function code 2 


call ROM 
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3; restore saved registers 
pop cx 


3 remove frame pointer 


mov sp, bp 
pop bp 
ret 


load_and_set_mode endp 


Ta, Mi Mn a, Sn, |, | |, i | | | | | | 


=e 


GET_QUERY_SIZE 


Retrieve the query size in bytes to reserve for a subsequent call to 
FILL_QUERY_STRUCTURE. The size of the header or header & mode tables can be 
retrieved. Input parameters are fetched from the stack. 


Inputs : WORD information type 
CL = 0 - header information only 
CL = 1 - header and mode table information 


Outputs: Returns size in bytes in ax 
AX = size in bytes 


public set_display_mode 


IFDEF mem_S 
get_query size proc near 


ELSE 
get_query_ size proc far 
ENDIF 
3; create frame pointer 
push bp 
mov bp, sp 
3; save registers used 
push cx 
7; setup parameters for call to ATI rom 
Mcall rom_base 3; get rom segment in ax 
mov rom_addr+2, ax 
Mov ax, WORD PTR [bp+PARM] 7; get info type flag 
mov cl, al 
mov ax, 8 ; function code 8 
mov rom_addr, 64h 
call DWORD PTR rom_addr 3; ¢all ROM 
3; check for errors in AH 
emp ah, 0 
© 1993 ATI Technologies Inc. PRG888GXO0-01 


Proprietary and Confidential F-79 


ROMCALLS.ASM 


je no_error 


3; error: set ax to zero 
Mov ax, 90 
jmp qsize_exit 


no_error: 
7; no error: move size from cx to ax 
mov ax, CX 


qsize_exit: 
3; restore saved registers 


pop cx 


3; remove frame pointer 


mov sp, bp 
pop bp 
ret 


get_query_ size endp 


FILL_QUERY_STRUCTURE 


Fill a query structure containing information about the installed hardware. 
This function should be called after called GET QUERY _SIZE and with the 
same information type flag. This insures that the structure is large 
enough to hold the fill information. Input parameters are fetched from the 
stack. 


me me ue me me Me ™e 


Inputs : WORD information type 
CL = 0 - header information only 
CL = 1 - header and mode table information 


FAR POINTER address of structure to be filled (DX:BX) 


Outputs: Returns error code in ax 
AX = 0 - no error 
AX = 1 - function complete with error 
AX = 2 - function not supported 


ee ee ee ee ee ee ee ee eee ee eee eee ee ee es ee ee) 


public set_display_mode 


=e =e ue =e =e =e =e ™e me ue me me 


™e 


IFDEF mem_S 
fill_query_structure proc near 
ELSE 
fill_query_structure proc far 
ENDIF 
3; create frame pointer 
push bp 
mov bp, sp 
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3; save registers used 


push bx 

push cx 

push ax 

3; setup parameters for call to ATI rom 

Mcall rom_base 7 get rom segment in ax 
mov rom_addr+2, ax 

mov cx, WORD PTR [bp+PARM] 7; get info type flag 
mov bx, WORD PTR [bp+PARM+2] 3; get offset of address 
mov ax, WORD PTR [bp+PARM+4] 3 get segment of address 
mov ax, 9 3; function code 9 

mov rom_addr, 64h 

call DWORD PTR rom_addr 3 ¢all ROM 


3; setup: error code in AL 
mov al, ah 
xor ah, ah 


3; restore saved registers 


pop ax 
pop cx 
pop bx 


3; remove frame pointer 


mov sp, bp 
pop bp 
ret 


fill_query_structure endp 


end 
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VINT .ASM 
Interface routines to handle MACH64 CRITIC vertical interrupts. 


Compiling: 
masm /Ml /D<memory model> romcalls.asm; 
<memory model> = mem_S for SMALL model, 
mem_M for MEDIUM model, 
mem_L for LARGE model 


me me me Ne =e me me SY ee) ee me 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


include ..\util\atim6é4d.ine 


IFDEF mem_S 


PARM equ 4 3; passed parameters start at bp+4 for small model 
ELSE 

PARM equ 6 3; passed parameters start at bp+6 for other models 
ENDIF 


IFDEF mem_S 
-MODEL SMALL, C 
ELSEIFDEF mem_M 
-MODEL MEDIUM, C 
ELSE 

-MODEL LARGE, C 
ENDIF 


- DATA 


- CODE 
- 286 


3; Macro for ‘call’ model handling 
Mcall macro routine 
IFDEF mem_S 

call NEAR PTR routine 
ELSE 

call FAR PTR routine 
ENDIF 

endm 


ENABLE_VBLANKINT 


Enable the vertical blank interrupt bit. 


me me Me 6 fF 
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Inputs : none 


Outputs: none 


VINT.ASM 


public enable _vblankint 


IFDEF mem_S 

enable _vblankint proc 
ELSE 

enable _vblankint proc 
ENDIF 


near 


far 


3 save registers used 


push dx 


3; set vertical blank interrupt bit 


mov ax, iLoCRTC_INT_CNTL 
in al, dx 

or al, 2 

out dx, al 


3; restore saved registers 


pop dx 


ret 


enable _vblankint endp 


A a i | a i a] 


=e 


DISABLE_VBLANKINT 


Inputs : none 


Outputs: none 


Enable the vertical blank interrupt bit. 


public disable_vblankint 


IFDEF mem_S 
disable_vblankint proc 
ELSE 

disable_vblankint proc 
ENDIF 


near 


far 


3; save registers used 


push ax 


3; clear vertical blank interrupt bit 


mov ax, 
in al, 
and al, 
out ax, 
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LoCRTC_INT_CNTL 
ax 

Ofbh 

al 


om om ewe em om eee ame em am we ee ee ow om om = am =m 6S ee Gm om om oe oe oe Oe oe oe om oe GS oe os ae oe OD ow ee ae a om Oe Se ee ee Oe 8m ee ee ee Oe om aw ae oe ee om oe om me ee oe oe ae oe ae om oe = 
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3; restore saved registers 
pop dx 


ret 


disable _vblankint endp 


WAIT_FOR_VBLANK 


Wait until vertical blank interrupt bit is set. It is assumed that the 
vertical blank interrupt enable bit is set. 


Inputs : none 


=e me ume 06% ume 0% me 2 6™e 06S 


Outputs: none 


public wait_for_vblank 


me 


IFDEF mem_S 


wait_for_vblank proc near 
ELSE 

wait _for_vblank proc far 
ENDIF 


3; save registers used 
push ax 


3; ack vertical blank interrupt to clear it 


mov ax, iLoCRTC_INT_CNTL 
in al, dx 

or al, 6 

out ax, al 


3; wait until it gets set again 


chkvblank: 
in al, dx 
and al, 4 
cmp al, 4 
jne chkvblank 


3; restore saved registers 
pop ax 


ret 


wait_for_vblank endp 


SET_VLINE 


Inputs : WORD vertical line to set interrupt bit at. 


Ly ee, ee ee ee 
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3; Outputs: none 
public set_vline 


IFDEF mem_S 


set_vline proc near 

ELSE 

set_vline proc far 

ENDIF 
3; ¢Create frame pointer 
push bp 
mov bp, sp 


3; save registers used 
push dx 


3; set vertical line to set interrupt bit at 


mov ax, WORD PTR [bp+PARM] 
mov dx, ioCRTC_VLINE_CRNT_VLINE 
out dx, ax 


3; restore saved registers 
pop dx 


3; remove frame pointer 


mov sp, bp 
pop bp 
ret 
set_vline endp 
ENABLEVLINEINT 


Enable the vertical line interrupt bit. 


Inputs : none 


me “Se Se Ne te Me tO OF 


Outputs: none 


public enable vlineint 


=e 


IFDEF mem_S 


enable _vlineint proc near 
ELSE 

enable vlineint proc far 
ENDIF 


3; save registers used 
push ax 


3; set vertical line interrupt bit 
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Mov ax, LoCRTC_INT_CNTL 
in al, dx 

or al, 8 

out ax, al 


3; restore saved registers 
pop ax 


ret 


enable vlineint endp 


DISABLEVLINEINT 
Enable the vertical blank interrupt bit. 
Inputs : none 


° 
a 
e 
, 
° 
a 
° 
a 
e 
a 
° 
a 
e 
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e 
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Outputs: none 


public disable_vlineint 


° 
s 


IFDEF mem_S 


disable vlineint proc near 
ELSE 

disable vlineint proc far 
ENDIF 


3; save registers used 
push dx 


3; clear vertical line interrupt bit 


mov ax, ioCRTC_INT_CNTL 
in al, dx 

and al, Of7h 

out dx, al 


3; restore saved registers 
pop dx 


ret 


disable vlineint endp 


WAIT_FOR_VLINE 


Wait until vertical line interrupt bit is set. It is assumed that the 
vertical line interrupt enable bit is set. Also, it is assumed that the 
CRTC_VLINE vertical line has been previously set in the 
CRTC_VLINE_CRNT_VLINE register. 


me te OMe OO CMe Oe Oe OO 
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Inputs : none 
Outputs: none 


public wait_for_viine 


IFDEF mem_S 


wait_for_vline proc near 
ELSE 

wait_for_vline proc far 
ENDIF 


3; save registers used 
push ax 


3; ack vertical line interrupt to clear it 


mov dx, LoCRTC_INT_CNTL 
in -al, ax 

or al, 18h 

out dx, al 


3; wait until it gets set again 


chkvline: 
in al, dx 
and al, 10h 
cmp al, 10h 
jne chkvline 


3; restore saved registers 
pop dx 


ret 
wait_for_vline endp 


end 
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C include file for functions in VINT.C 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


/* Routines in VINT.c */ 
void enable _vblankint (void) ; 
void disable_vblankint (void) ; 
void wait_for_vblank(void) ; 


void set_vline(int vline); 
vold enable vlineint (void) ; 
void disable vlineint (void); 
void wait_for_vline (void); 
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Routines to bring in a TGA file to video memory for the sample code examples. 


Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


See Se SS See SSSR SS SS SSS SSS SS SSS SSE SS SESE SESS SSS SS SSeS SSS SSS ss sess sess / 


#include <stdio.h> 
#include <stdlib.h> 
#include <string.h> 
#include <conio.h> 


#include “..\util\atim64.h” 
#include “..\util\sample.h” 
#include “vtga.h” 


GET TARGA HEADER - load targa file header information 


Inputs : £ilename, 
pointer to header structure 


Return codes: SUCCESS - success 
OPEN_ERROR - can’t open file 

ee ok Si oc sas aa Se mtg Ta ikl i pe ae i. ask a fet asa et i,m en a a * / 

int get_targa_header(char *filename, TARGA_HEADER *header) 

{ 

FILE *TargaFile; 
// open targa file 
TargaFile = fopen(filename, “rb”); 
1f (TargaFile == NULL) 
{ 
return (OPEN_ERROR) ; 
} 
// check for support ability (type 1, 8 bpp, color map) 
fread(header, sizeof(TARGA_HEADER), 1, TargaFile) ; 
fclose (TargaFile) ; 
return (SUCCESS) ; 

} 

/* (50 cm em cee cam cm cca me me meme meme Mate re mam ene me eh Ome sO me Ont em meats itn om Sm > GO em Sm me Oe mm ems ea Oe ms eee ee to oem ee on es me oe oe ee 
SET _ TARGA PALETTE - load a targa image color table and set the palette 
Inputs : filename of targa image file 
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Return codes: SUCCESS - success 
OPEN_ERROR - can’t open file 
IMAGE TYPE_ERROR - targa file does not have color table 


Notes : - a valid set accelerator mode is assumed 
-~ the image size is intended to be fairly small 
- accelerator mode is assumed to be 8 bpp 


int set_targa_palette(char *filename) 


FILE *TargaFile; 
TARGA _HEADER header; 
PALETTE entry; 

int i; 


// open targa file 
TargaFile = fopen(filename, “rb”); 
if (TargaFile == NULL) 
{ 
return (OPEN_ERROR) ; 
} 


// check for support ability (type 1, 8 bpp, color map) 
fread(&header, sizeof(TARGA_HEADER), 1, TargaFile); 


// support check for type 1, 8 bpp, color map 

if ((header.cm_type != 1) I] 
(header.image_type != 1) |] 
(header.pixel_depth != 8) |] 
(header.image_coord == 1) || 
(header.image_coord == 3)) 


// return error if image type is not supported (no color table) 
fclose(TargaFile) ; 
return (IMAGE TYPE_ERROR) ; 

} 


// read color map and setup palette for 8 bpp 
for (1 = header.cm_origin; i < header.cm_length; i++) 
{ 
entry.blue = fgetc(TargaFile) >> 2; 
entry.green = fgetc(TargaFile) >> 2; 
entry.red = fgetc(TargaFile) >> 2; 
set_palette(i, entry); 
} 


fclose(TargaFile) ; 


// return to caller 
return (SUCCESS) ; 
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LOAD _TARGA - load a targa image into video memory 


Inputs : filename of targa image file, 
(xX, y) starting position (top-left corner), 
paletteflag (set or don’t set for 8 bpp mode) - 
NOSET PALETTE or SET_PALETTE 


Return codes: SUCCESS - success 
OPEN_ERROR - can’t open file 
MEMORY_ERROR - not enough memory for input buffer 
IMAGE TYPE _ERROR - targa file is not supported 


Notes : - a valid set accelerator mode is assumed 
-~ the image size is intended to be fairly small 
- accelerator mode is assumed to be 8 bpp 


load_targa(char *filename, int x, int y) 


FILE *TargaFile; 

TARGA HEADER header; 

unsigned char *inbuffer; 

unsigned long templ, temp2, temp3, temp4, data; 
unsigned long totalbytes, hostwrts; 

int drawx, drawy, j; 

int shifter, remainder; 

int adder, error; 

unsigned char r, g, b; 

unsigned char value; 


// open targa file 
TargaFile = fopen(filename, “rb’”); 
if (TargaFile == NULL) 
{ 
return (OPEN_ERROR) ; 
} 


// check for support ability (type 1, 8 bpp, color map) 
fread(&header, sizeof (TARGA HEADER), 1, TargaFile); 


// support check for type 1, 8 bpp, color map 

error = 0; 

1f ((header.cm_type != 1) II 
(header.image_type != 1) I| 
(header.pixel_depth != 8) || 
(header.image_coord == 1) |] 
(header.image_coord == 3)) 


error++; 
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// support check for type 2, 24 bpp, no color map 
if ((header.cm_type != 0) |] 
(header.image_type != 2) II 
(header.pixel_ depth != 24) || 
(header.image_coord == 1) || 
(header.image_coord 3)) 


error++; 


// return error if image type is not supported 
if (error > 1) 


fclose(TargaFile) ; 
return (IMAGE TYPE ERROR); 


// skip over color table if 8 bpp image type 
if (header.pixel_ depth == 8) 


fseek(TargaFile, (unsigned long) (header.cm_length * 3), SEEK_CUR); 
} 


// open input buffer (image width * image pixel depth in bytes) 
inbuffer = (unsigned char *) malloc ((header.width +1) * (header.pixel_ depth 
/ 8));3 
if (inbuffer == NULL) 
{ 
printf(“Not enough memory to show targa file.\n”); 
fclose(TargaFile) ; 
return (MEMORY_ERROR) ; 
} 


// determine if an additional host write is needed at the end of the loop 
remainder = 0; 
totalbytes = (unsigned long) (header.width) * (unsigned long) (header.height) ; 
hostwrts = totalbytes / 4; 
if ((hostwrts * 4) != totalbytes) 
{ 
remainder++; 
} 


// setup engine for buffer to screen host data transfer 
wait_for_idle(); 

templ = regr(DP_SRC); 

temp2 = regr(DP_MIX); 

temp3 = regr(SRC_CNTL) ; 

temp4 = regr(DST_CNTL) ; 


regw(DP_SRC, FRGD_SRC_HOST) ; 
regw(DP_MIX, FRGD_MIX_S | BKGD_MIX_S); 
regw(SRC_CNTL, 0); 
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if (header.image_coord == 0) 
{ 
// bottom left 
regw(DST_CNTL, DST_Y _BOTTOM_TO TOP | DST_X LEFT TO_RIGHT); 
regw(DST_X, X); 
regw(DST_Y, y + header.height - 1); 
regw(DST_HEIGHT, header.height) ; 
regw(DST_WIDTH, header.width) ; 


else 


// top left 
regw(DST_CNTL, DST_Y TOP_TO_BOTTOM | DST_X_RIGHT_TO_ LEFT); 
regw(DST_X, x + header.width - 1); 
regw(DST_Y, y); 
regw(DST_HEIGHT, header.height) ; 
regw(DST_WIDTH, header.width) ; 
} 


// main draw loop 
if (header.pixel_depth == 24) 
{ r 


adder = 3; 
} 
else 
{ 

adder = 1; 
} 
j = 0; 
shifter = 0; 
data = 0; 


for (drawy = 0; drawy < header.height; drawy++) 
{ 
// get next image scanline 
fread((unsigned char *)inbuffer, 
header.width * (header.pixel_depth / 8), 1, TargaFile); 


// get next line of pixels from input buffer 
for (drawx = 0; drawx < (header.width * (header.pixel depth / 8)); drawx 
= drawx + adder) 
{ 
if (header.pixel_depth == 24) 
{ 
// get 24 bpp data and convert to 8 bpp 


g = (unsigned char) (*(inbuffer + drawx) & Oxff); 
b = (unsigned char) (*(inbuffer + drawx + 1) & Oxff); 
r= (unsigned char) (*(inbuffer + drawx + 2) & Oxff); 


value = (unsigned char)((r & Oxe0) | ((g >> 3) & Ox1lc) | ((b >> 
6) & 0Ox03)); 
} 
else 
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// get 8 bit data 
value = (unsigned char) (*(inbuffer + drawx) & Oxff); 
} 


data = data | ((unsigned long) (value) << shifter); 
shifter = shifter + 8; 


// accumulate 32 bits at a time before writing 
j++; 
if (j > 3) 
. 
wait_for_fifo(1); 
regw(HOST_DATAO, data); 
j = 0; 
shifter 


= O; 
data = O; 


} 


// write remaining data if needed 
if (remainder != 0) 
{ 
wait_for_fifo(1); 
regw(HOST_DATAO, data); 
} 


// restore main engine context 
wait_for_idle(); 

regw(DP_SRC, temp1); 
regw(DP_MIxX, temp2); 
regw(SRC_CNTL, temp3); 
regw(DST_CNTL, temp4) ; 


// close buffers and targa file 
free (inbuffer) ; 
fclose(TargaFile) ; 


// return to caller 
return (SUCCESS); 
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Copyright (c) 1993 ATI Technologies Inc. All rights reserved 


MEE SPS tS SSS PBS SSSR SSS SS SSE SSS SESE SS SESE SESS SSE SESS SESS SSS SEs ss essssssces* / 


/* Return codes for targa routines */ 
#define SUCCESS 
#define OPEN ERROR 
#define MEMORY ERROR 
#define IMAGE TYPE _ERROR 


/* Use BYTE alignment */ 
#pragma pack(1) 


typedef struct 


{ 
char 
char 
char 
int 
int 
char 
int 
int 
int 
int 
char 
char 


id_length; 
cm_type; 
image_type; 
cm_origin; 
cm_length; 
cm_entrysize; 
x_origin; 
y_origin; 
width; 
height; 
pixel_depth; 
image_coord; 


} TARGA_HEADER; 


0 


1 
2 
3 


file id length (follows header, 
0 = no color map, 1 = color map 
1 = uncompressed, color map 
starting color map index 
number of color map entries 
bits/color map entry 
starting x position 
starting y position 
image width in pixels 
image height in pixels 
image bits per pixel (8, 
image start coordinates 


if applicable) 


16, 24) 


// OO - bottom left 
// 1 - bottom right 
// 2 - top left 
// 3 - top right 


/* Restore to WORD alignment */ 
#pragma pack() 


/* Routines in VTGA.C */ 


int get_targa_header(char *filename, TARGA HEADER *header) ; 
int set_targa_palette(char *filename) ; 


int load_targa(char *filename, 
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int x, int y); 
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Glossary of Terms 


A 


alternate-fill XXXXX XXXXX XXXXX XXXXX XXXXX 
XXXXX. 


B 


background 
Bresenham 


C 

clipping 
color source 
context 
CRTC 


D 


destination aligned 
destination compare 
destination trajectory 
direct color 

draw engine 


F 


foreground 


H 


host 
host data 
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linear frame buffer 
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L 


linear frame buffer 


mix 
monochrome 
monochrome source 


O 


offset 


P 


packed monochrome 
paged frame buffer 
palette 

pattern 

pitch 

pixel depth 
pseudocolor 


ROP 


scissoring 

side effect 
source aligned 
source compare 
source trajectory 


T 
tiling 
trajectory 
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Accelerator CRTC and DAC registers E-3 
Accelerator mode 2-12 
Engine initialization 
Sample code 2-13 
Memory aperture 2-12 
Using the BIOS to set 2-3 
Accessing the EEPROM 5-2 
Sample code 
reading and writing from 
EEPROM 5-2 
Advanced topics 1 4-1 
CRT synchronization 
Single buffering (synchronized) 4- 
22 


Advanced topics 2 5-1 
Advanced topics1 

CRT synchronization 

Double buffering (memory) 4-15 

Application performance 6-5 
ATIM64.H F-5 
ATIM64.INC F-1 
ATTR.C F-9 


B 


Backward compatibility 1-2 
BIOS interface 2-3 
Sample code 2-4 
BIOS services A-1 
Function 0, load accelerator CRTC 
parameters A-1 
Function Oah, return clock chip 
frequency table A-4 
Function OBh, program clock chip A-4 
Function 0Ch, set DPMS mode A-4 
Function 0Dh, return current DPMS 
state A-5 
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Function 0Eh, set graphics controller’s 
power management state A-5 
Function OFh, return graphics 
controller’s current power 
management state A-5 
Function 1, set display mode A-2 
Function 10h, set RAMDAC state A-5 
Function 11h, return external storage 
device information A-5 
Function 12h, Short query A-6 
Function 2, load accelerator CRTC 
parameters and set display 
mode A-2 
Function 3, read EEPROM data A-2 
Function 4, write EEPROM data A-2 
Function 5, memory aperture services 
Function 6, return hardware capability 
list A-3 
Function 6, short query function A-3 
Function 8, return size of device query 
data structure A-4 
Function 9, device query A-4 
Bitblt 3-2 
Bitblt, blit 2-12 
Blits 2-57 
Transparent 4-13 
Block write 6-3 
Boot-time initialization 5-1 


Cc 


Chained contexts 2-47 
Clock chip reference 
ATI 18811-0, ATI18811-1, ATI18811-2 
D-1 
Command FIFO 2-30 
Sample code 2-30 
Waiting for draw engine idle 2-31 
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Waiting for engine idle 
Sample code 2-31 
Waiting for sufficient FIFO entries 2- 
30 
Compilers 
Assembler 
Microsoft MASM compiler F-1 
Cc 
Microsoft C compilers F-1 
Concurrency 6-1 
Constants 1-4 
Context chains 4-48 
Sample code 
Chain a blit and a line and halt 4- 
48 
CRT controller (CRTC) 2-3 
CRT mode 
Designing a custom CRT mode 2-8 
CRT parameter table B-4 
CRT parameters C-1 
1024x768 56Hz non-interlaced C-7 
1024x768 60Hz non-interlaced C-8 
1024x768 66Hz non-interlaced/72 C-9 
1024x768 66Hz non-interlaced/75 C-8 
1024x768 70Hz non-interlaced C-9 
1024x768 72Hz non-interlaced C-10 
1024x768 76Hz non-interlaced C-10 
1024x768 87Hz interlaced C-7 
1120x750 60Hz non-interlaced C-11 
1120x750 70Hz non-interlaced C-12 
1120x750 87Hz interlaced C-11 
1280x1024 60Hz non-interlaced C-13 
1280x1024 70Hz non-interlaced C-14 
1280x1024 74Hz non-interlaced C-14 
1280x1024 87Hz interlaced C-12 
1280x1024 95Hz interlaced C-13 
640x480 60Hz non-interlaced C-1, C-2 
640x480 72Hz non-interlaced/32 C-2 
640x480 72Hz non-interlaced/40 C-3 
800x600 60Hz non-interlaced C-5 
800x600 70Hz non-interlaced C-5 
800x600 72Hz non-interlaced C-6 
800x600 76Hz non-interlaced C-6 
800x600 89Hz interlaced C-3, C-4 
800x600 95Hz interlaced C-4 
CRT synchronization 4-15 
CRTC compatibility 2-3 


D 
DAC programming 5-3 
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Sample code 5-3 
DACs 5-3 
Delta framing 4-46 

Sample code 4-46 
Designing a custom CRT mode 2-8 

Pixel clocks 2-10 

Sample code 2-11 
Destination trajectories and blits 2-12 
Destination trajectory 1, rectangular 2-38 
Destination trajectory 2, line 2-39 
Detecting presence of a maché4 2-1] 
Diagnostic features 5-4 

Other test features 5-5 

Test mode 0, all test features disabled 

5-4 


Test mode 1, memory read /write test 


Test mode 2, source and destination 
length test 5-4 
Test mode 3, source FIFO read length 
counter test 5-4 
Test mode 4, CRTC test 5-5 
Test mode 5, display CRC test 5-5 
Direct color modes 2-49 
Double buffering (memory) 4-15 
In the interrupt service routine for the 
system timer 4-15 
In the mainline application 4-15 
Sample code 4-16 
Double buffering (palette) 4-22 
DRAM 6-2 
Draw engine 2-12 
Destination trajectories and blits 2-12 
Draw engine context control registers E-4 
Draw engine contexts 2-47 
Chained 2-47 
Draw engine initialization 2-13 
Sample code 2-13 
Draw engine trajectory control registers E- 


Draw operations 2-57 

Blits 2-57 

Lines 2-57 

Style 1, simple drawing without using 
contexts 2-57 

Style 2, default context loads for every 
style of draw operation 

Sample code 2-58 

Style 2, drawing using restored 
contexts 2-58 

Style 3, all draw operations done with 
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context chains 
Context chains 2-61 
Style 3, drawing using context chains 
2-61 
Draw speed 6-1 
DRAW.C F-38 


Drawing in packed 24 bit per pixel mode 4- 
‘| 


Drawing Polygons 4-2 
Drawing polygons 4-1 


E 


EEPROM 
Accessing 5-2 
EEPROM data structure B-1 
EEPROM map B-1 
CRT parameter table B-4 
Accelerator parameters B-4 
VGA parameters B-4 
EEPROM data structure B-1 
Efficiency 6-2 
Expansion buses 6-2 
EISA 6-2 
ISA 6-2 
MCA 6-2 
PCI 6-2 
VLB 6-2 


F 


Features 1-1] 
Fixed patterns 3-9 
Sample code 3-9 


G 


General pattern 3-3 
Sample code 3-3 

General pattern with rotation 3-4 
Sample code 3-4 

GLOB.C F-26 


H 


Hardware cursor 2-49 
Pseudo color modes and direct color 
modes 2-49 
Sample code 2-50 
Host data consumption 2-35 
HWCURSOR.C F-20 
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Memory bandwidth 6-3 
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INIT.C F-46 
Initialization 
Boot-time 5-1 
BUS_CNTL 5-1 
CONFIG _CHIP_ID, 
CONFIG_STATO, 
CONFIG_STAT1 5-1 
CONFIG_CNTL 5-1 
GEN_TEST_CNTL 5-1 
MEM_CNTL 5-1 
SCRATCH_REGO, 
SCRATCH_REG1 5-1 
Interrupts 4-12 


L 


Line draw 3-11 
Sample code 3-11 
Line patterns 3-7 
Sample code 3-7 : 
Linear and paged memory apertures 2-18 
Big aperture 2-24 
Sample code 2-24 
PCI bus implementation 2-18 
Small apertures 2-18 
Sample code 2-19 
Standard 64k VGA aperture 2-18 
Lines 2-57 
Logical pixel data path 2-32 


M 


mach64 accelerator 
Deletions from mach32 1-3 
Detection 2-1 
Sample code 2-2 
Functional differences from mach32 1- 
3 
Functional enhancements to mach32 
1-2 
Major features 1-1 
Overview 1-1 
Relationship to previous ATI 
accelerators 1-2 
Sample code organization 1-4 
Manual mode switching 2-8 
Memory aperture 2-12 
Memory bandwidth 6-3 
Example 6-4 
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Memory management 
Off-screen 4-47 
Sample code 4-47 
Memory mapping E-6 
MEMREG.C F-59 
Mode switching 2-3 
BIOS interface 2-3 
Setting accelerator mode 2-3 
CRT controllers 2-3 
CRTC compatibility 2-3 
Designing a custom CRT mode 2-8 
Manual 2-8 
Non-volatile storage 2-8 
Modes 
Switching 2-3 
monochrome expansion bitblt 3-5 
Sample code 3-5 
MOVEMEM.ASM F-65 


N 


Non-volatile storage 2-8 


O 


Off-screen memory management 4-47 
Operating modes 

Accelerator mode 2-1, 2-12 

VGA mode 2-1 
Overview 1-1 


P 


Packed 24 bit per pixel mode 
Drawing in 4-7 

PALETTE.C F-69 

Pattern consumption 2-36 

Performance issues 6-1 
Application performance 6-5 
Block write 6-3 
Concurrency 6-1 
Draw speed 6-1 
Efficiency 6-2 
Expansion buses 6-2 
Memory bandwidth 6-3 

Example 6-4 

Redundancy 6-1 
System performance 6-5 
VRAM vs DRAM 6-2 

Pixel clocks 2-10 

Pixel depth 2-64 
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Drawing 4-1 
Sample code 4-2 


Programming DACs 5-3 
Programming model 2-1 


Accelerator mode 
Draw engine 2-12 
Engine initialization 2-13 
Memory aperture 2-12 
Command FIFO 2-30 
Draw engine contexts 2-47 
Chained 2-47 
Draw operations 2-57 
Hardware cursor 2-49 
Linear and paged memory apertures 
Big aperture 2-24 
Small apertures 2-18 
Standard 64k VGA aperture 2-18 
Logical pixel data path 2-32 
Host data consumption 2-35 
Pattern data consumption 2-36 
Memory apertures 
Linear and paged 2-18 
Operating modes 
Accelerator mode 2-12 
Pixel depth 2-64 
Register mapping 2-27 
Source and destination alignment 2-44 
Source and destination mixing logic 2- 
46 
Trajectories 2-37 
Destination trajectory 1, 
rectangular 2-38 
Destination trajectory 2, line 2-39 
Source trajectory 1, strictly linear 
2-40 
Source trajectory 2, unbounded Y 
2-40 
Source trajectory 3, general pattern 
2-41 
Source trajectory 4, general pattern 
with rotation 2-42 
Trajectory modifier 1, 
SRC_BYTE_ALIGN 2-43 
Trajectory modifier 2, 
DST_POLYGON_ENA 2- 
43 
Trajectory modifier 3, 
DP_BYTE_PIX_ORDER 2- 
43 
VGA interaction 2-65 
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Rectangle fill 3-1 
Sample code 3-1 
Redundancy 6-1 
Register mapping 2-27 
Sample code 2-27 
Register Summary 
Draw engine trajectory control E-5 
Register summary E-1 
Accelerator CRTC and DAC E-3 
Draw engine context contro] E-4 
Setup and control E-2 
VGA E-1 
Registers 
Memory mapping E-6 
ROMCALLS.ASM F-72 


S 


Sample Code 
Drawing Polygons 4-2 
Sample code 
Accessing the EEPROM 
Reading and writing from 
EEPROM 5-2 
Assembler 
ATIM64.INC F-1 
Microsoft MASM compiler F-1 
MOVEMEM.ASM F-65 
ROMCALLS.ASM F-72 
VINT.ASM F-82 
Big memory aperture 2-24 
Cc 


ATIM64.H F-5 
ATTR.C F-9 
DRAW.C F-38 
GLOB.C F-26 
HWCURSOR.C F-20 
INIT.C F-46 
MEMREG.C F-59 
Microsoft C compilers F-1 
PALETTE.C F-69 
SAMPLE.H F-27 
VTGA.C F-89 
WAIT.C F-63 

C and assembler 1-4 

Command FIFO 2-30 

Constants 1-4 
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Setting accelerator mode using the BIOS 2-3 


Context chains 
Chain a blit and a line and halt 4- 


DAC programming 5-3 
Delta framing 4-46 
Double buffering (memory) 4-16 
DP_WRITE_MASK register 3-15 
Draw engine initialization 2-13 
Draw operations 2-57, 2-58, 2-61 
Fixed patterns 3-9 
General pattern 3-3 
General pattern with rotation 3-4 
Hardware cursor 2-50 
header 
VTGA.H F-95 
Line draw 3-11 
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Scrolling and panning 
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desktop 4-8 
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BIOS 2-4 
Simple one-to-one bitblt 3-2 
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Source and assembler F-1 
Source and destination mixing 3-16 
Template 1-4 
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VINT.H F-88 
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SAMPLE.H F-27 
Saving and restoring a context 3-1 
Scissoring and masking 3-14 
DP_WRITE_MASK 
Sample code 3-15 
Sample code 3-14 
Scrolling and panning 4-8 
Sample code 
640x480 screen on 1024x768 
desktop 4-8 
Setting accelerator mode using the BIOS 2- 
3 
Sample code 2-4 
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Setup and control registers E-2 U 
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Bitblt 3-2 Utility functions 1-5 
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General pattern with rotation 3-4 V 
Line patterns 3-7 
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Test mode 0, all test features disabled 5-4 
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